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Abstract. For every integer fc ^ 2 let be the set of all words over k, that 

is, all finite sequences having values in [k] := {f,...,fc}. A Carlson-Simpson 
tree of [/c]<** of dimension m > 1 is a subset of [A;]^'^ of the form 

{ui} U {w'^wo{ao)'^ ...'^Wn{an) ■ n e {0, m — 1} and ao, an e [fc]} 

where is a word over k and {wn Y^Zo ^ finite sequence of left variable words 
over k. We study the behavior of a family of measurable events in a probability 
space indexed by the elements of a Carlson-Simpson tree of sufficiently large 
dimension. Specifically we show the following. 

For every integer k ^ 2, every < e ^ 1 and every integer n ^ 1 there exists 
a strictly positive constant 6{k,e,n) with the following property. If m is a 
given positive integer, then there exists an integer Cor(fc,£,m) such that for 
every Carlson-Simpson tree T o/[A;]^^ of dimension at least Cor(fc,e,m) and 
every family {At : t £ T} of measurable events in a probability space (Q, S, /x) 
satisfying tJ.{At) ^ e for every t aT , there exists a Carlson-Simpson tree S of 
dimension m with S C T and such that for every nonempty F C S we have 

n At)^e(k,s,\F\). 

The proof is based, among others, on the density version of the Carlson- 
Simpson Theorem established recently by the authors, as well as, on a refine- 
ment - of independent interest — of a partition result due to H. Furstenberg 
and Y. Katznelson. The argument is effective and yields explicit lower bounds 
for the constants 6(k,e,n). 



1. Introduction 

1.1. Overview. The present paper ~ which is the sequel to [U [6] - is devoted to 
the study of the structure of a family of measurable events in a probability space 
indexed by a Ramsey space [3j. The most classical and illuminating case is when 
the events are indexed by the natural numbers. Specifically, let (fi, E,/x) be a 
probability space and {Ai : i G N} be a family of measurable events in (f2,S,/i) 
satisfying ^{Ai) ^ e > for every i € N. Using Ramsey's Theorem [21] and 
elementary probabilistic estimates, for every < 6 < e wc may select an infinite 
subset L of N such that for every integer n ^ I and every subset of L of cardinality 
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n we have 

(1.1) /x(fi^,)^r. 

In other words, the events in the family {Ai : i e L} are at least as correlated as if 
they were independent. 

Now suppose that the events are indexed by another Ramsey space S. A nat- 
ural problem is to decide whether the aforementioned result is valid in the new 
setting. Namely, given a family {Ag : s G 5} of measurable events in a probability 
space (0,E,/z) satisfying fi{As) ^ e > for every s S <S, is it possible to find a 
"substructure" S' of S such that the events in the family {A^ : s G S'} are highly 
correlated? And if yes, then one would like to get explicit (and, hopefully, optimal) 
lower bounds for their joint probability. 

In all cases of interest, this problem is essentially equivalent to that of finding 
"copies" of given configurations inside dense sets of discrete structures, a theme of 
fundamental importance in Ramsey Theory. The equivalence between the two per- 
spectives is discussed in detail in [7, §8.1] and is based on the "regularity method", 
a remarkable discovery of E. Szemeredi [24 asserting that dense sets of discrete 
structures are inherently pseudorandom. 

1.2. The main result. Our goal in this paper is to study the above problem when 
the events are indexed by words; recall that, for a given integer fc 5^ 2, a word over 
k is just a finite sequence having values in [k] :— {1, fc}. The set of all words 
over k is denoted by [A:]^^. 

In this context the most natural and fruitful notion of "substructure" is that of 
a Carlson-Simpson tree [11 [7]. To recall the definition we need, first, to introduce 
some pieces of notation and some terminology. Specifically, let k ^ 2 and fix a letter 
X that we regard as a variable. A variable word over fc is a finite sequence having 
values in [fc] U {a;} where the letter x appears at least once, while a left variable 
word over fc is a variable word over fc whose leftmost letter is the variable x. If w is 
a variable word and a e [fc], then w{a) is the word over fc obtained by substituting 
all appearances of the letter a; in w by a. The concatenation of two words u and v 
over fc is denoted by u'~'v. 

Definition 1.1. Let fc G N with fc ^ 2. A Carlson-Simpson sequence over k is a 
finite sequence (w,Wo, w,„_i) where m is a positive integer, w is a word over fc 
and Wq, Wm-i o.Te left variable words over fc. A subset of [k]"^^ of the form 

(1.2) {w} U {w"^wo{aQ)'" ...""Wn{an) : n G {0, m — 1} and aq, a„ G [fc] } 

where (w, wq, Wm-i) is a Carlson-Simpson sequence over k, will be called a 
Carlson-Simpson tree of [fc]<^. 
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It is easy to see that the Carlson-SnTipson sequence (w, wq, ...,Wm-i) that gen- 
erates a Carlson-Simpson tree T via formula ()1.2|) is unique. The corresponding 
positive integer m will be called the dimension of T and will be denoted by dim(r). 

We are ready to state the first main result of the paper. 

Theorem 1.2. For every integer k ^ 2, every < e ^ 1 and every integer n ^ 1 
there exists a strictly positive constant 6{k,e,n) with the following property. If m 
is a given positive integer, then there exists an integer Cor(fc, e, m) such that for 
every Carlson-Simpson tree T of[k]^^ of dimension at least Cor(fc,£,m) and every 
family {At : t G T} of measurable events in a probability space {fl, S, fi) satisfying 
/i(y4.f) ^ e for every t G T , there exists a Carlson-Simpson tree S of dimension m 
with S C_T and such that for every nonempty F <Z S we have 



Of course, the main point is that, for fixed parameters k and e, the lower bound 
on the joint probability of the events {At : t G F} given in ()1.3|) depends only on the 
cardinality of the set F and not on the dimension of the Carlson-Simpson tree S. 
The argument is effective and yields explicit estimates for the constants 9(k,e,n). 
These estimates are admittedly rather weak and it is an important problem to 
obtain "civilized" bounds. We point out, however, that if we restrict our attention 
to a certain class of subsets of Carlson-Simpson trees, then we get optimal lower 
bounds. This is the content of §10 and §11 in the main text. One of the consequences 
of our analysis is that the constant 9{k,e,2) can be chosen to be — o(l), an 
estimate which is clearly sharp. 

1.3. On the proof of Theorem 11.21 The first basic ingredient of the proof of 
Theorem 11.21 is the density version of the Carlson-Simpson Theorem established, 
recently, in [7 . The second one is a refinement - of independent interest - of a 
partition result due to H. Furstenberg and Y. Katznelson. 

Before we state this refinement, let us start with a brief motivating discussion. 
Suppose that we color the set of all pairs of, say, [2]<^. Is it then possible to find 
a Carlson-Simpson tree of [2]^^ of large dimension all of whose pairs are of the 
same color? This natural Ramsey-type problem is easily seen to have a negative 
answer. Indeed, color red all pairs which are of the form {w{l)'^u{2),w{2)'~'u{l)} 
where w and u are variable words over 2; color the remaining pairs blue. Clearly, 
every Carlson-Simpson tree of [2] <^ of dimension at least 2 contains pairs of both 
colors. 

In spite of the existence of pathological colorings, there is non-trivial information 
on the aforementioned problem. The central idea - which has proven to be highly 
valuable in related parts of Ramsey Theory; see, e.g., [2j [12] - is to categorize all 
pairs of [2]^'*' (and, more generally, all subsets of [k]'^^ of a given cardinality) in a 



(1.3) 
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list of classes each of which has the Ramsey property. This can be done with the 
help of the notion of typcQ, introduced in [10, §2], which we are about to recall. 

Let k,q ^ N with k ^ 2 and q ^ I. Also let £ be a set of cardinality q which 
is disjoint from [k] and denote by W(A;,£) the set of all words over [k] U C, that 
is, all finite sequence having values in [k] U C. A type of W(fc, £) is a nonempty 
finite sequence in £ having no consecutive multiple appearances of the same letter. 
For instance, if £ = {Ai, A2}, then (Ai, A2, Ai) is a type but (Ai, Ai, A2) is not. For 
every w G W(fc, £) \ [/c]<^ we assign its type as follows. First we erase all letters 
of w which belong to [k], then we shorten the runs of the same letters of £ to 
singletons and, finally, we push everything back together. For example, the type of 
(1, Ai, 2, Ai, 5, A2, 7, A2, Ai, 8, A3) is the word (Ai, A2, Ai, A3). 

Now let n be a positive integer and w = (w, wq, Wn-i) be a Carlson-Simpson 
sequence over k. Also let v = (aq, 0^-1) G W(fc, £) be a word of length at most 
n. We set 

(1.4) w{v) ^ w^wo{ao)^wi{ai)^ ...^Wm~i{am~i) 

with the convention that w(w) = w if u is the empty word. Finally let 

(1.5) w(fc,£) = {w(i;) : v £ W(fc, £) is of length at most n}. 

We are ready to state the second main result of the paper. It is a refinement of 
[lOl Theorem 2.7]. 

Theorem 1.3. Let k,q,d,r e N with k ^ 2 and q,d,r ^ 1. Then there ex- 
ists a positive integer N with the following property. If n ^ N and C is a set 
of cardinality q which is disjoint from [k], then for every Carlson-Simpson se- 
quence w = {w,wo, ...,Wn-i) over k and every r-coloring of w(A;,£) there exists a 
Carlson-Simpson subsequence v — {v,Vo, ...,Vd-i) of w such that for every type r 
of W(fc, £) the set of all words in v(fc, £) of type r is either empty or monochro- 
matic. The least integer N with this property will be denoted by FK(fc, q, d, r). 

Moreover, there exists a primitive recursive function iy9 : N** — > N belonging to 
the class £^ of Crzegorczyk's hierarchy such that 

(1.6) FK{k,q,d,r) ^ip{k,q,d,r) 

for every fc, g, d, r G N with k ^ 2 and q,d,r ^ 1. 

We remark that a Carlson-Simpson subsequence of w is just a Carlson-Simpson 
sequence v which is obtained by successively "blocking" consecutive members of w 
(the precise definition is given in §2.3). 



^We remark that the term "type" is not used in |10| . Actually, this notion is not named at all 
neither in [10] , nor in other places in the literature. 
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1.4. Structure of the paper. The paper is organized as follows. In §2 we set 
up our notation and terminology and we gather some background material needed 
in the rest of the paper. In §3 we give the proof of Theorem 11.31 As we have 
already pointed out, Theorem 11.31 is the main tool for the analysis of the Ramsey 
properties of various families of subsets of Carlson-Simpson trees. This analysis is 
of independent interest and is carried out in §4, §5 and §6. The reader will find in 
§4.1 a discussion on the content of these sections. 

The next three sections are devoted to the proof of Theorem 1 1.21 The main bulk 
of the argument is contained in §7 and is heavily based on the material developed 
in the previous sections. The last step is given in §8. The proof of Theorem 11.21 is 
then completed in §9. In the next two sections, §10 and §11, we discuss quantitative 
refinements of Theorem 11.21 Finally, in §12 we make some comments. 

The paper ends with two appendices. They contain some results needed for the 
proof of Theorem 11.31 While most of these results are well-known, they are pre- 
sented in some detail for the benefit of the reader and for completeness. Emphasis 
is given on quantitative aspects of proofs. 

2. Background material 

By N = {0, 1, 2, ...} we shall denote the natural numbers. For every integer n ^ 1 
we set [n] ~ {1, ...,n}. If X is a nonempty finite set, then by ExeJf we shall denote 
the average ^xex where \X \ stands for the cardinality of X . For every function 
/ : N ^ N and every ^ e N by : N N we shall denote the ^-th iteration of / 
defined recursively by (n) — n and /^^^^''{n) = f(^f'^^\n)) for every n G N. 

2.1. Words over a finite alphabet. Let ^4 be a finite alphabet, i.e., a nonempty 
finite set. For every n g N let A" be the set of all sequences of length n having 
values in A. Precisely, contains just the empty sequence while if n ^ 1, then 

(2.1) A" = {(ao,...,a„_i) : a^ ^ A for every i G {0, n - 1}}. 
Also let 

n 

(2.2) A<"+i = U 
and 

(2.3) A<^^ = y A". 

The elements of A"^^ are called words over A, or simply words if A is understood. 
The length of a word w over A, denoted by is defined to be the unique natural 
number n such that w € A". For every « S N with i ^ \w\ by w\i we shall denote 
the word of length i which is an initial segment of w. More generally, if is a 
nonempty subset of such that for every w W we have \w\ ^ i, then we set 

(2.4) W\i^{w\i:w€W}. 
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The concatenation of two words wi and W2 over A will be denoted by w^W2- 
Moreover, for every pair Wi and W2 of nonempty subsets of we set 

(2.5) W^W2 = {w'^W2 : wi e VFi and W2 G W2}. 

If w\ and W2 are two words over then their infimum is the greatest common 
initial segment of wi and W2 ■ It will be denoted by wi A W2 . The infimum operation 
can be naturally extended to nonempty subsets of words. Specifically, let W he a 
nonempty subset of A"^^. The infimum of W, denoted by AW, is the word over 
A of greatest length which is an initial segment of every w G W. Notice that 
wi Aw2 = A{wi,W2} for every wi,W2 G A"^^ . 

Finally let A: G N with fc ^ 2. A special case - which is, nevertheless, of particular 
importance - of an alphabet of cardinality k is the set [k] . The elements of [k] 
will be called words over k. Of course, if A is another alphabet with \A\ = k, then 
the sets A'^^ and [k] <^ are naturally "isomorphic" . We need to consider words over 
arbitrary alphabets for reasons that will become clear in the sequel. 

2.2. Variable words. Let Abe a, finite alphabet and n be a positive integer. Fix 
a set {xq, Xn-i} of distinct letters which is disjoint from A. We view the set 
{xq, ...,Xn-i} as a set of variables. An n-variable word over A is a finite sequence 
having values in A U {xq, Xn-i} such that: (a) for every i G {0, n — 1} the 
letter Xi appears at least once, and (b) if n ^ 2, then for every i,j G {0, ...,n — 1} 
with i < j all occurrences of Xi precede all occurrences of Xj. If A is understood, 
then n-variable words over A will be referred to simply as n-variable words while 
1-variable words over A will be referred to as variable words. A left variable word 
is a variable word whose leftmost letter is the variable x. 

Remark 2.1. The concept of an n-variable word over A is closely related to the 
notion of an n-parameter word over A introduced by R. L. Graham and B. L. 
Rothschild in [13|. Indeed, recall that an n-parameter word over A is also a finite 
sequence having values in AU {xq, Xn-i} satisfying condition (a) above and such 
that: (b') if n ^ 2, then for every i, j G {0, n — 1} with i < j the first occurrence 
of Xi precedes the first occurrence of Xj. In particular, every n-variable word is an 
n-parameter word. Of course, when "n = 1" the two notions coincide. 

As above, let A be a finite alphabet. For every n-variable word w over A and 
every /3o, ...,/3„_i G AU{xo, ...,Xn-i} by wdSo, ...,/3„_i) we shall denote the unique 
word over [k] U {xq, ...,x„_i} obtained by substituting in w all appearances of the 
letter Xi with (3i for every i G {0, ...,n — 1}. Notice that (/3o, /3n-i) G A" if 
and only if w(f3a, f3n-i) is a word over A. More generally, if m G [n,], then 
(/3o, ■■■,/3n-i) is an m-variable word if and only if w(/3o, ■■■,l3n-i) is an m-variable 
word. An m-variable word of the form w{I3q, ...,/3„_i) will be called an m-variable 
subword of w. 
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We recall some basic combinatorial results concerning words. The first one is 
due to A. H. Hales and R. I. Jewett fTS] . 

Theorem 2.1. For every k,r £ N with fc ^ 2 and r ^ 1 there exists a positive 
integer N with the following property. If n ^ N , then for every alphabet A with 
\A\ = k and every r-coloring of A" there exists a variable word w of length n 
such that the set {u){a) : a € A} is monochromatic. The least integer N with this 
property will be denoted by }13{k,r). 

The Hales-Jewett Theorem is one of the cornerstones of modern Ramsey Theory. 
The best known upper bounds for the numbers HJ(A:,r) are due to S. Shelah. 
Specifically, by Theorem 1.5], there is a primitive recursive function </) : — )■ N 
belonging to the class £^ of Grzegorczyk's hierarchy such that for every integer k ^ 2 
and every integer r ^ 1 we have that HJ(fc, r) ^ (f>(k, r). 

We will also need the following theorem. 

Theorem 2.2. Let k, d, m, r be positive integers with k ^ 2 and d ^ m. Then 
there exists a positive integer N with following property. If n ^ N , then for every 
alphabet A with \A\ = k and every r-coloring of the set of all m-variable words 
over A of length n, there exists a d-variable word vu over A of length n such that 
all m-variable subwords of w are monochromatic. The least integer N with this 
property will be denoted by GR(fc,d, TO,r). 

Theorem 12.21 is a variant of the Graham-Rothschild Theorem [T3] which refers 
to rn-parameter words instead of m-variable words. It follows from the work of S. 
Shelah - see [531 page 687] - that the original "Graham-Rothschild numbers" are 
bounded by a primitive recursive function belonging to the class £^ of Grzegorczyk's 
hierarchy. On the other hand there are several detailed expositions as well as infinite 
extensions of Theorem 12.21 - see, e.g., [U [lOl [17]. While it is possible to extract 
explicit upper bounds for the numbers GR(A;, d, m, r) following known arguments, 
such a statement is not available in the literature. For completeness we carry out 
such an analysis in Appendix A. 

2.3. Carlson— Simpson sequences. Let /c G N with k ^ 2. As we have already 
mentioned in Definition II. li a Carlson-Simpson sequence over fc is a nonempty 
finite sequence of the form 

(2.6) w = (w, Wo, ...,i«„-i) 

where n is a positive integer, w is a word over k and wq, ...,Wn-i are left variable 
words over fc. Of course, Carlson-Simpson sequences over fc are in one-to-one 
correspondence with Carlson-Simpson trees of [fc]^^. In several cases, however, it 
is very convenient to work with Carlson-Simpson sequences, and as such, we find 
it appropriate to explicitly isolate this concept. 
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Let w — {wjWq, ...,Wn-i) be a Carlson-Simpson sequence over k. The integer 
n will be called the dimension of w and will be denoted by dim(w). Also let 
m G [n\. A Carlson-Simpson sequence v — (u, uq, Wm-i) over k will be called 
an m- dimensional Carlson-Simpson subsequence of w if there exist a sequence 
{aiYlZ^ in [k] U {x} and a strictly increasing sequence (n^)™ q in {0, n} such that 
the following conditions are satisfied. 

(CI) For every i g {0, m — 1} we have a„; — x. 

(C2) If riQ = 0, then v — w. Otherwise, we have that oq, flno-i G [k] and 

(2.7) V = w'"WQ{af)y ..rwno-i{ana-i)- 
(C3) For every i G {0, to — 1} we have 

(2.8) Vi{x) ^WnMn,yWni+l[an, + iy...^Wni+i~l[an,+^^l). 

The set of all m-dimensional Carlson-Simpson subsequences of w will be denoted 
by Subseq„(w). 

We will need the following theorem. It is a reformulation of "ll Theorem 4.1]. 

Theorem 2.3. Let k,d,m,r he positive integers with k ^ 2 and d ^ to. Then 
there exists a positive integer N with the following property. If n ^ N , then for 
every n-dimensional Carlson-Simpson sequence w over k and every r-coloring of 
Subseq^(w) there exists v G Subseq(;(w) such that the set Subseq„(v) is monochro- 
matic. The least integer N with this property will be denoted by CS(fc, d, to, r). 

The proof of Theorem 12.31 given in [7] yields upper bounds for the numbers 
CS{k,d,m,r) which are expressed in terms of the numbers GR(fc, d, to, r). We 
shall present a different proof in Appendix B which makes this dependence more 
transparent. The argument, combined with the results in Appendix A, yields in 
particular that the numbers CS{k,d,m,r) are bounded by a primitive recursive 
function belonging to the class £^ of Grzegorczyk's hierarchy. 

2.4. Carlson— Simpson trees. Recall that a Carlson-Simpson tree of [k]'^^ is a 
set of the form 

(2.9) {w} U {u''"wo(ao)'^...'"w,i(a„) : n G {0, to — 1} and qq, a„ G [k] } 

where {w,Wo, ...,Wm-i) is a Carlson-Simpson sequence over k. Observe that the 
Carlson-Simpson sequence {w, wq, Wm-i) that generates a Carlson-Simpson tree 
W via formula (j2.9p is unique. It will be called the generating sequence of W. Also 
recall that the corresponding natural number to is called the dimension of W and 
is denoted by dim(W^). The 1-dimensional Carlson-Simpson trees will be called 
Carlson-Simpson lines. 

Let W be an TO-dimensional Carlson-Simpson tree of [k]"^^ and {w, wq, Wm-i) 
be its generating sequence. For every n G [to] the n-level W{n) of W is defined by 

(2.10) W{n) = {w'"wo(ao)'"...'"wn_i(a„_i) : ao,...,a„_i G [k]}. 
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The 0-level W{0) of W is defined to be the smgleton {w}. 

For every m-dimensional Carlson-Simpson tree W of [k]^^ and every i G [m] 
by Subtr£(Ty) we shall denote the set of all ^-dimensional Carlson-Simpson trees 
of [k]"^^ which are contained in W. An element of Subtr£(H^) will be called an 
I -dimensional Carlson-Simpson subtree of W . The set Subtr^(VF) is in one-to-one 
correspondence with the set Subseq£(w) where w stands for the generating sequence 
of W . Indeed, notice that for every Carlson-Simpson tree V of [fej^*^ generated by 
the sequence v we have that V £ Subtr^ (Vl^) if and only if v e Subseq£(w). 

A natural example of a Carlson-Simpson tree of [k]'^^ of dimension m is the 
set [A:]^"'"'""'^. Actually, every Carlson-Simpson tree of dimension m can be thought 
of as a "copy" of [fc]<'"+^ inside [fc]^'*. Specifically, let W be an m-dimensional 
Carlson-Simpson tree of [fc]^^ and (w, wq, Wm-i) be its generating sequence. 
The canonical isomorphism associated to W is the bijection Iw ■ [fc]<™+^ W 
defined by 1iy{0) = w and 

(2.11) lw{{ao,...,an-i)) = w'^woiao)^ ..rwn^i{an-.i) 

for every n G [m] and every (oq, ...,a„_i) e [fc]". The canonical isomorphism Iw 
preserves all structural properties one is interested in while working in the category 
of Carlson-Simpson trees. Precisely, we have the following. 

Fact 2.4. Let W be an m-dimensional Carlson-Simpson tree of [fc]^^. Then the 
following are satisfied. 

(a) If i G [m] and V is a Carlson-Simpson subtree of [k]"^™^^ of dimension i, 
then its image lw{V) under the canonical isomorphism is an i-dimensional 
Carlson-Simpson subtree ofW. 

(b) For every nonempty subset F of [fc]<™+-'^ we have In/(AF) — /\lw{F). In 
particular, Carlson-Simpson trees preserve infima. 

By Fact 12. 4( for most practical purposes we may identify an m-dimensional 
Carlson-Simpson tree W with [fc]^'"^^ via the canonical isomorphism \w- 
We close this subsection with the following consequence of Theorem 12.31 



Lemma 2.5. Let k,d,r be positive integers with k ^ 2. IfW is a Carlson-Simpson 
tree W of [k]<^' with 



then for every r -coloring ofW there exists V G Subtr(i(Vl^) which is monochromatic. 

Proof. Let w be the generating sequence of W and fix an r-coloring c : — ?> [r]. 
Define c : Subseqi(w) — [r] by the rule 



for every {v,vq) G Subseq]^(w). Invoking Theorem 12.31 and (|2.12p . we may select 
V = {v,vq, ...,Vd) G Subseqj;_|_]^(vif) such that the set Subseq]^(v) is monochromatic 



(2.12) 



dim(VK) ^ CS(fc,d + l,l,r), 



(2.13) 



c{{v,vo)) = c{v) 
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with respect to the coloring c. Let V be the rf-dimensional Carlson-Simpson tree 
generated by the sequence {v,vq, Wd-i). It is easy to see that V is as desired. □ 

2.5. Probabilistic preliminaries. We recall the definition of a class of probabil- 
ity measures on [fc]*^^ introduced by H. Furstenberg and B. Weiss in [11]. Specif- 
ically, let k,m eN with fc ^ 2 and m ^ 1. The Furstenberg-Weiss measure d™^ 
associated to [fc]^'"^^ is the probability measure on defined by 

(2.14) d?W(^)-E„e{o,...,™}^^^- 
We will also need the following standard lemma. 

Lemma 2.6. Let < < e ^ 1 and n E N with n ^ (e^ _ ■d'^)-'^ . If {A,)^^^ is a 
family of measurable events in a probability space (fi, E, /i) satisfying iJ-{Ai) ^ e for 
every i G [n], then there exist i,j G [n] with i ^ j such that ii{Ai Ci Aj) ^ -d^ . 

Proof. We set X = J27=i '^^i where is the indicator function of the event Ai 
for every i £ [n]. Then K[X] ^ en so, by convexity, 

(2.15) ti{A,r\Aj)^E[X{X-l)]^en{en~l). 

i£ln]je[n]\{i} 

Therefore, there exist i,j G [n] with i ^ j such that fi{Ai O Aj) ^ 6'^ . □ 

2.6. The density Carlson— Simpson Theorem. We will need the following re- 
sult (see [7, Theorem B]). 

Theorem 2.7. For every integer k ^ 2, every integer m ^ 1 and every < 6 ^ 1 
there exists an integer N with the following property. If L is a finite subset ofN of 
cardinality at least N and A is a set of words over k satisfying \ A D [k]"\ ^ 6k"' for 
every n E L, then there exists a Carlson-Simpson sequence {w,wo, ...Wm~i) over k 
such that the set 

(2.16) {w} U {w'" WQ{aa)"^ . . Wn{an) : n E {0, m — 1} and Og, a„ E [/c]} 

is contained in A. The least integer N with this property will be denoted by 
DCS(fc,m,(5). 

Theorem 12.71 is the density version of a well-known coloring result due to T. J. 
Carlson and S. J. Simpson [4]. Also we notice that the argument in [7^ is effective 
and gives explicit upper bounds for the numbers DCS(fc, to, 6). These upper bounds, 
however, have an Ackermann-type dependence with respect to k. 

The final result of this subsection is a consequence of Theorem 12.71 To state it 
we need, first, to introduce some numerical invariants. Specifically, for every integer 
k ^ 2 and every < S ^ 1 we set 

(2.17) A = A(fc, S) = \Ad-^T)CS{k, 1, (5/4)] 
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and 



We have the following lemma. 

Lemma 2.8. Let k E N with k ^ 2 and < S ^ 1, and define A — A{k,5) 
as in p.l7p . Also let {At : t £ [k]'"^} be a family of measurable events in a 
probability space (f2,I],/i) satisfying fJ-{At) ^ 6 for every t € [k]<^. Then there 
exists a Carlson-Simpson line S of [fc]^^ such that 

(2.19) Jf]At)^rjik,S) 



tes 



where r](k,S) is as in (j2.18p . 



We remark that Lemma 12.81 follows from [7, Lemma 7.9]. It is based on an 
argument that can be traced in an old paper of P. Erdos and A. Hajnal [S]. For 
the convenience of the reader we include the proof. 

Proof of Lemma \2.8[ Let dp^^ be the Furstenberg- Weiss measure associated to 
[k]<^ and set 

(2.20) y = e f7 : d^y^\{t E [k]<^ : lo e At}) ^ ^2}. 

Since ^(^t) ^ ^ for every t € [k]<^ we have ii{Y) > S/2. 

Let w G y be arbitrary and set A^ — {t £ [k]'^-^ : uj E At} . Observe that 

(2.21) |{n£{0,...,A-l}: ' " ^ ' ^ ^/4}| ^ (J/4)A ^ DCS(fc, 1, <S/4). 

By Theorem 12.71 there exists a Carlson-Simpson line of [k] with Suj C Au, . 
In particular, we have 5*^^ C [k]"^^ and 

(2.22) OJE f] At. 

By the classical pigeonhole principle, there exist Z E T. and a Carlson-Simpson line 
S of [k] <^ such that Si^ = S for every ui E Z and 

^-oo-iA m(^) ^ S/2 ini 

^^^^ ^ |Subtr,([fc]<A)| ^ |Subtr,([fc]<A)| = '^(^''^^^ 

Hence, 



(2.24) /^(fl^O ^^(^) ^ '?(^''^) 

and the proof is completed. □ 
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3. Proof of Theorem 11.31 

This section is devoted to the proof of Theorem 11.31 stated in the introduction. 
It is organized as follows. In §3.1 we introduce some pieces of notation and isolate 
some basic properties of types, while in §3.2 we gather some preliminary tools. The 
proof of Theorem 11.31 is completed in §3.3. 

3.1. Types: definitions and basic properties. Let fc, g G N with fc ^ 2 and 
q ^ 1. Also let C be an alphabet of cardinality q which is disjoint from [k] and 
denote by W(fc, C) the set of all words over [k] U C Recall that a type of W(fc, jC) is 
a nonempty word over C having no consecutive multiple appearances of the same 
letter. The set of all types of W(fc, C) will be denoted by T{L). Also recall that for 
every w G W(fc,£) \ [/cj^^ its type is defined as follows. First we erase all letters 
of w which belong to [k], then we shorten the runs of the same letters of C to 
singletons and, finally, we push everything back together. For every r G T{C) by 
W(fc, £, r) we shall denote the set of all w G W(A:, C) of type r. Hence, 

(3.1) W(fc,£)\[A:]<«= y W(fc,/:,T). 

rer(£) 

Let w = (w, wq, w„_i) be an n-dimensional Carlson-Simpson sequence over k. 
For every v — (ao, flm-i) £ W(fc, C) of length at most n we set 

(3.2) •w{v) ^w'^wo[aiiywi[aiy..rwm-i{am-i) 

with the convention that w(u) = w if w is the empty word. We define 

(3.3) •w{k,C) — {w(t;) : v G W(fc, £) of length at most dim(w)}. 
Moreover, for every t G T{C) we set 

(3.4) w(fc, £, t) ^ w(fc, C) n W(fc, C, t). 
We will need the following elementary fact. 

Fact 3.1. Let w be an n-dimensional Carls on- Simp son sequence over k. 

(a) For every r G T{C) the set w(k,C,T) is nonempty if and only if \t\ ^ n. 
Moreover, if W is the Carlson-Simpson tree generated by w, then 

(3.5) w(fc,£)\T4^== IJ w{k,C,T). 

{Ter(£):|r|sSn} 

(b) If V is a Carlson-Simpson subsequence of^, then 

(3.6) v(fc,/:) C w(fc,/:). 
Moreover, for every t G T(>C) we have 



(3.7) 



v{k, C, t) C w(fc, £, r). 
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3.2. Preliminary tools. We have the following lemma. 

Lemma 3.2. Let fc,n e N with k ^ 2 and n ^ I. Also let C be a finite alphabet 
which is disjoint from [k] andw be a Carlson-Simpson sequence over k of dimension 
n. Then for every m € [n] and every t G T{C) with \t\ = to we have 

(3.8) w(fc,/:,r) = {z(r) : z G Subseq,„(w)}. 

Proof. Write w — {w,wq, ...,w„-i) and fix to G [n] and a type r = (Aq, Am-i). 
By (|3.2p . we see that z(r) is of type r for every z G Subseq^(w). Conversely, let 

V = (ao, a^-i) G W(fc, £) such that w(u) G w{k,C,T). We need to find an to- 
dimensional Carlson-Simpson subsequence z of w such that z(t) — w(v). Observe 
that the words v and w(u) are of the same type, and therefore, v is of type r. For 
every i G {0, to — 1} we set 

(3.9) n, = min{j G {0, 1} : aj = Xi}. 

Notice that ^ no < ... < rim-i < £ ^ n. For every i G {0, ...,to — 1} we set a[ = ai 
if fli G [k]\ otherwise we set a[ = x. We define an m-dimensional Carlson-Simpson 
subsequence z — (z, zq, z„i-i) of w as follows. First we set 

(3.10) z = w^woia'^r ..rwno-i{a'„„^-^) 

with the convention that z = w if no = 0. Next for every i G {0, m — 1} let 

(3.11) z, = w„,ia'„j^..rwn,^,-iia'„^^^_^). 

It is easily checked that z is an m-dimensional Carlson-Simpson subsequence of w 
and satisfies z(t) = w{v). This shows that w(fc, £, r) C {z(t) : z G Subseq^(w)} 
and the proof is completed. □ 

Lemma [3.21 has the following consequence. It will enable us to reduce the proof 
of Theorem 11.31 to Theorem 12.31 

Corollary 3.3. Let k and C be as in Lemma \3.Sl Also let d be a positive integer 
and w = (w, wq, W2d-i) be a Carlson-Simpson sequence over k of dimension 2d. 
Then, setting w' = {w,Wo, ...,Wd^i), for every r G T{C) of length at most d we 
have 

(3.12) W{k,C,T) C {z(t) : z G Subseqrf(w)}. 

Proof. We start with two elementary observations. First notice that if z,z' are 
Carlson-Simpson sequences over k with z' an initial segment of z, then for every 

V G VF(fc, C) of length at most dim(z') we have 7.{v) = z'{v). Next observe that for 
every Carlson-Simpson subsequence z' of w' there exists a (not necessarily unique) 
Carlson-Simpson subsequence z of w with dim(z) — d and such that z' is initial 
segment of z. 



14 



PANDELIS DODOS, VASSILIS KANELLOPOULOS AND KONSTANTINOS TYROS 



Now fix a type r of length at most d and set m = \t\. By Lemma [3.21 and using 
the previous remarks, we see that 

(3.13) w'{k,C,T) = {z'(r) : z' G Subseq„(w)} C {z(r) : z e Subseqrf(w)} 

as desired. □ 
3.3. Proof of Theorem [mi We claim that 

(3.14) FK{k,q,d,r) CS(A:, 2d, d, X^') 

for every k,q,d,r G N with fc ^ 2 and q,d,r ^ 1. Indeed, let £ be an alphabet of 
cardinality q which is disjoint from [k]. Also let w be a Carlson-Simpson sequence 
over k with dim(w) ^ CS(fc, 2c?, d, ^ ) and fix an r-coloring c : w(fc, £) [r]. 
We set T ~ {t ^ T{C) : \t\ ^ d}. We define a coloring c : Subseq^(v^r) — > [j-]^ by 

(3.15) c(z) = (c(z(r)) : r e T) . 

By Fact 13.11 the coloring c is well-defined. Also notice that 

d 

(3.16) |r|<^g^<q'^+l. 

i=l 

Hence, by Theorem 12.31 there exists a Carlson-Simpson subsequence u of w of 
dimension 2c? such that the set Subseq£;(u) is monochromatic with respect to c. Let 
V be the unique Carlson-Simpson sequence over k of dimension d which is an initial 
segment of u. Notice, in particular, that v is a Carlson-Simpson subsequence of w. 
Fix a type r. If r ^ T, then by Fact 13.11 the set v(fc,£,T) is empty. Otherwise, by 
Corollary [23] and the choice of u, the set v(fc, £, r) is monochromatic. This shows 
that p. 141) is satisfied. 

Finally, by Corollary IB . 2 1 and taking into account this estimate, we see that the 
numbers FK(fc,g, c?,r) are bounded by a primitive recursive function belonging to 
the class £^ . The proof of Theorem 11.31 is thus completed. 

4. Flat sets 

4.1. Motivation. As we have mentioned in the introduction, there is no analogue 
of Ramsey's classical Theorem for colorings of pairs of [k]"^^ (or, more generally, of 
subsets of [fc]^^ of a given cardinality). Nevertheless, there is non-trivial informa- 
tion on this Ramsey-type problem. The relevant tools will be developed in this and 
the next two sections. Specifically, we shall define certain classes of finite subsets of 
[fc]*-^ with the following crucial properties. Firstly, each class is partition regular. 
Secondly, their union is sufficiently "dense" in the sense that for every nonempty 
finite subset F of [k] one can find an element of one of the classes that contains 
F and whose cardinality is effectively controlled by the cardinality of F. 

In this section we make the first step towards this goal. In particular, we define 
the family of flat sets which are the building blocks of the elements of the classes 
mentioned above. Their properties are discussed in §4.3 and §4.4. 
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4.2. Definition. Let fc 6 N with k ^ 2. A finite subset F of [k]<^^ wili be calied 
flat provided that (a) |F| > 2, and (b) there exists n e N such that F C [A;]". By 
Fl([fc]<^) we shall denote the set of all flat subsets of [k]<^\ If p e N with p > 2, 
then Flp([fc]<^) stands for the family of all flat sets of cardinality p. Moreover, for 
every Carlson-Simpson tree W of [/c]^^ by Fl(H^) we shall denote the set of all flat 
subsets of [k]'^^ which are contained in W. The set Flp(W^) is analogously deflned. 
Notice, in particular, that Flp{W) ^ if and only if 2 < p < fcdim(w) Therefore, 

J.dim(HO 

(4.1) Fl{W)= U Flp{W). 

p=2 

4.3. Word representation of flat sets and their type. Let k and p be a pair of 
integers with k,p^ 2. These parameters will be fixed throughout this subsection. 
We set 

(4.2) A([fcn = {(a^):ae[fc]} 

p— times 

and 

(4.3) £, = [fcr\A([fcn. 

As in §3.1, let W{k, Cp) be the set of all words over [k] U£p. We shall define a map 

(4.4) Rp:Flj,([fc]<N)^W(fc,£p) 

as follows. Let F G Flp([A;]^'^) be arbitrary and write the set F in lexicographical 
increasing order as {Iq <iex ••• <iex ip-i}- Also let n € N be such that F C [fc]" and 
notice that n ^ 1. For every i £ {0, ...,n — 1} and every j £ {0, ...,p — 1} let 
be the i-th coordinate of tj and set Ui = (oj,0) •••> aj,p-i)- Observe that Oj € [k]^ for 
every i G {0, n — 1}. We set 

if ai = (a,...,a)€A([fcf), 



(4.5) 



p— times 

ai if at e [k]P\A{[k]P). 



Finally we define 

(4.6) Rp(^^) = (ao,...,a«-i). 

We call the word Rp(-F) as the word representation of the flat set F. 

Before we proceed let us give a specific example. Let fc = 2 and p = 4 and 
consider the subset F of [2]^ consisting of the elements (1,2,2), (1,1,2), (2,1,2) 
and (2,2,2). We order F lexicographically as 

(4.7) F={{1, 1, 2) <iex (1, 2, 2) <iex (2, 1, 2) <iex (2, 2, 2)} 
and we observe that 

(4.8) So = (1, 1, 2, 2), ai = (1, 2, 1, 2) and as = (2, 2, 2, 2). 
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Thus, RiiF) is the word ((1, 1, 2, 2), (1, 2, 1, 2), 2) e W(2, £4). 

Now let F e Flp([fc]<'^). We define the type of F to be the type of its word 
representation Rp{F) in W{k,Cp). In particular, the type of F is an element of 
T{Cp). Notice that for every integer p' > 2 with p' p we have that £p' CiCp = 0, 
and so, T{Cp') D T{Cp) = 0. Therefore, if F and F' are flats sets of different 
cardinality, then their types are different. 

For every r 6 r(£p) by Flp,^([fc]<N) we shall denote the set of aU F £ Flp([fc]<'*) 
of type r. It is easy to see that if Flp.7.([fc]<^) is nonempty, then p ^ fcl'^L Notice, 
however, that not every r e T{Cp) with p ^ fcl"^' is realized as the type of a flat set 
of cardinality p. This is due to the fact that the type of a flat set F is determined 
after we have ordered F lexicographically. Taking into account these remarks, we 
set 

(4.9) %i[k,p] - {re T{Cp) : Flp^,([fc]<N) ^ 0}. 

Of course, we can relativize the above definitions to Carlson-Simpson trees. Specif- 
ically, for every Carlson-Simpson tree W of [fc]<^ and every r e T{Cp) by F\p_r{W) 
we shall denote the set of all F G Flp_T([fc]^'^) which are contained in W. Observe 
that Flp_i-(VF) is nonempty if and only if r G TFi[k,p] and |t| < dim(I^). 
We summarize, below, some basic properties of the map Rp. 

Fact 4.1. Let k,p G N with k,p ^ 2 and r G 7Fi[fc,p]. Then the following hold. 

(a) The map Rp : Flp([fc]^'*') — W{k,Cp) is an injection. 

(b) The restriction of Hp to Flp^^([fc]^'^) is ontoW{k,Cp,T). In particular, the 
map Rp : Flp^,-([fc]^^) ^{k, €-p,T) is a bijection. 

Fact 14. II is a straightforward consequence of the relevant definitions. We will also 
need the following elementary fact. 

Fact 4.2. Let G N with k,p 2. Also let V he a Carlson-Simpson tree of 
[k]"^^ and denote by v the Carlson-Simpson sequence that generates V. Finally let 
T G T{Cp) such that the set Flp^^(V) is nonempty. Then the following are satisfied. 

(a) Setting m = dim{V), we have that Flp,^(F) = ly (Flp,^([fc]<'"+i)) where 
ly is the canonical isomorphism associated to V . 

(b) The restriction of Hp to Flp(P^) is an injection into ■v(k,Cp). 

(c) The restriction of Hp to Fp.T-(t^) is onto ^{k, Cp^r). In particular, the map 
Rp : Flp.T-(T^) — > \'{k, Cp,T) is a bijection. 

4.4. Ramsey properties of flat sets. We have the following proposition. 

Proposition 4.3. Let k,d,r G N with k ^ 2 and d,r ^ 1. Then there exists 
a positive integer N with the following property. If n ^ N , then for every n- 
dimensional Carlson-Simpson tree W o/[fc]*-^ and every r -coloring of¥\{W) there 
exists a d-dimensional Carlson-Simpson subtree VofW such that every pair of flat 
subsets of V with the same type is monochromatic. 
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Proof. We set 

(4.10) g = ^(F-fc). 

p=2 

We will show that the desired positive integer N can be chosen to be the number 
FK{k,q,d,r). To this end let n > FK{k,q,d,r) be arbitrary. Also let W be 
an n-dimensional Carlson-Simpson tree of [k]^^'^ and denote by w its generating 
sequence. Fix a coloring c : Fl(VF) — > [r]. Let 

(4.11) C^\JCp. 

p=2 

By the definition of the finite alphabet Cp in (|4.3p and the choice of g in (|4.10p , we 
see that |£| — q. Next let be the subset of Fl(W^) defined by 

(4.12) e J" 4^ the length of the type of F is at most d. 

Notice that 2 < |i^| ^ k'^ for every F e F. We define K : F ^ w(fc, C) by the rule 
R(F) = R|^|(i^). It is easy to see that the map R is a well-defined injection. 
Finally let c : w(fc,£) [r] be defined by 



(4.13) c{w) 



c{F) if there exists F eF with R{F) 
r otherwise. 



Since dim(w) > FK(fc, g, d, r), by Theorem 11.31 there exists v G Subseq^(w) such 
that the set v(fc,£,r) is monochromatic with respect to c for every r G T{C) with 
t| ^ d. Let V be the Carlson-Simpson tree generated by v. By Fact 14. 2 [ we see 
that V is as desired. □ 

5. Basic sets 

In this section we continue the analysis outlined in §4.1. We start with the 
following definition. 

Definition 5.1. A basic set of [k]"^^ is a subset of [k]'^^'^ of the form 
(5.1) B^{s}u[j{s-F-..rF,) 

1=0 

where £ is a positive integer, s is a word over k and for every i G {0, — 1} the 
set Fi is a flat subset of [k] whose infimum AFi is the empty word. 

Notice that the sequence (s, Fq, F^-i) that generates a basic set B via formula 
(|5.ip is unique. It will be called the generating sequence of B. The word s will be 
called the top of B, while the positive integer £ will be called the dimension of B 
and will be denoted by dim(i3). 

Let i? be a basic set of [fc]<^ and {s, Fq, Fe^i) be its generating sequence. 
The level width sequence of B, denoted by p(B), is defined to be the sequence 
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{\Fq\, ...,\Fi^i\). The integer maxosji^f-i will be called the width of B and 
will be denoted by w{B). Finally, the type of B is defined to be the sequence 
(tq, ...,T£_i) where Ti is the type of the flat set Fi for every i e {0, ...,£— 1}. It will 
be denoted by t{B). Notice the following rigidity property of the type of B: if B' 
is another basic set with t{B') = t{B), then dim(i3') = dim(i3) and p{B') = p{B). 

By B([/c]^^) we shall denote the set of all basic sets of [fej^**^. The set of all basic 
sets of [fc]<^ which are contained in a Carlson-Simpson tree W of will be 

denoted by B{W). 

Now let £ be a positive integer and p = {po, ...,pi-i) be a finite sequence in N 
with Pi ^ 2 for every i G {0, ...,£— 1}. Also let t — {tq, ...,Tg_i) be such that 
Ti e T{Cpi) for every i £ {0, ...,£ — 1}, where is as in ()4.3p . We set 

(5.2) B,,p^t(W<^) = {B£ B([fc]<f*) : dim(S) = £,p{B) = p and t(S) = t}. 

For every Carlson-Simpson tree W of [A:]<^ the set B^^p^t(W^) is analogously defined. 
Observe that the set B^_p^t([^]^^) is nonempty if and only if n G 7Fi[fc,Pi] for 
every i e {0, — 1}, where 7Fi[fc,pi] is as in (|4.9p . Respectively, if is a 
Carlson-Simpson tree W of [A:]<^, then the set B^_p t(Vl^) is nonempty if and only 
if G 7Fi[fc,Pj] for every i e {0, 1} and I]j=o l"^*! ^ dim(M^). 

Much of our interest on basic sets stems from the fact that they possess strong 
structural properties. In particular, we have the following theorem which is the 
main result of this section. 

Theorem 5.2. Let k,d,r (zN with k ^ 2 and d, r ^ 1. Then there exists a positive 
integer N with the following property. If n ^ N , then for every n-dimensional 
Carlson-Simpson tree W of [k]"^^ and every r-coloring of }i{W) there exists a d- 
dimensional Carlson-Simpson subtree UofW such that every pair of basic subsets 
of U with the same type is monochromatic. The least integer N with this property 
will be denoted by RamB(A:, d, r). 

Moreover, there exists a primitive recursive function % : N'"* — > N belonging to 
the class £^ of Grzegorczyk's hierarchy such that 

(5.3) RamB(fc, d, r) ^ x(fc, d, r) 
for every k,d,r £ N with k ^ 2 and d,r ^ 1. 

We have already pointed out in Fact 12.41 that Carlson-Simpson trees preserve 
infima. Also notice that the type of a basic set does not depend on its top. Tak- 
ing into account these remarks, it is easy to verify that Proposition 14.31 follows 
from Theorem 15.21 In fact. Theorem 15.21 can be seen as the "higher-dimensional" 
extension of Proposition 14.31 

Proof of Theorem \5.S[ It is similar to the proof of Proposition 14.31 In particular, 
our strategy is to represent all basic subsets of a given Carlson-Simpson tree as 
words over an alphabet of the form [k] U C where C is an appropriately chosen 
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finite set. Once this is done, the result fohows by a straightforward apphcation of 
Theorem II .31 We proceed to the details. 

Fix k, d,r eN with fc ^ 2 and d,r ^ 1. We set 

(5.4) q = d-^{kP ~ k). 

p=2 

By Theorem ll.3[ it is enough to prove that 

(5.5) RaniB(fc, d, r) =^ FK(fc, q, d, r). 

To this end let be a Carlson-Simpson tree of [k]"^'^ with dim(VF) > FK(fc, g, d, r) 
and fix a coloring c : B(VF) [r\. Denote by w the generating sequence of W . 

For every i £ {0, ...,d~ 1} and every p e {2, fc'^} let — CpX {i}. Denote by 
ip : [k] U Cp ^ [k] U the natural bijection defined by ip(a) = a for every a £ [k] 
and ip(A) = (A, i) for every A S Cp. The map ip lifts to a bijection 

(5.6) ll:W{k,£p)^W{k,Cl) 

defined by Ij, ((so, ...,s„)) = (i;(so), -, ij,(s„)) • Also let : Flp([fc]<N) ^ W(fc, 
be the "representation" map defined in §4.3 and set 

(5.7) R;=i;oRp. 

By Factim the map R^ : Flp([fc]<'^) W(fc,£p is an injection. 
We define 

d-l k"^ 

(5.8) £ = U U £; 

1=0 p=2 

and we observe that 

d-l k'' k'' 

(5.9) |£|=^^|/:;|=d-^(F-fc)=<z. 

i=0 p=2 p=2 

Also notice that for every i 6 {0, d — 1} and every p g {2, fc'^} we have that 
W(fc,£p) C W{k,£). Therefore, the map R^ can be seen as an injection from 
Flp([fc]<N) into W{k,C). 

Now let B be the subset of B(W^) defined by 

e-i 

(5.10) B eB^iit{B) = (tq, r^-i), then ^ |r,| d. 

4=0 

Observe that for every B £ B the width w{B) of B is at most k'^. We define a map 

(5.11) R:i3^W(fc,/:) 
by the rule 

(5.12) R(B) = s-R^„(Fo)-Ri^(^^i)-...-R^;_\(F,_i) 
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where {s, Fq, Fe-i) is the generating sequence of B and pi = \Fi\ for every 

1 G {0,...,^— 1}. The following properties follow easily taking into account the 
relevant definitions. 

(PI) The map R is an injection into w(fc,£). 

(P2) Let £ G {l,...,d}. Also lot p = (po,---.W-i) and t = (tq, t^_i) where 
Pi e {2, fc"^} and n S T{CpJ for every i e {0, ...,£- 1}. Finally let V 
be a Carlson-Simpson subtree of W generated by some v G Subseqj;(w). 
Assume that the set B^^p^t(^) is nonempty and set 

(5.13) r = l°„(ro)-...-1^7_\(T,_i). 

Then t G T{C) and |r| = Yl\=li ^ dim(l/) = d. Moreover, we have 
B^,p,t(V^) ^ B and the map R : B^^p^t(^) — >■ 'v{k,C,T) is a bijection. 
Finally we define c : w(fc, £) — >■ [r] by 

{c(B) if there exists B with R(B) = w, 
r otherwise. 

Notice that, by (PI), the coloring c is well-defined. Since dim(w) ^ FK(fc, q, d, r), 
there exists v G Subseq^(w) such that the set v{k,C,T) is monochromatic with 

respect to c for every r G T(-C) with |t| ^ d. Let y be the d-dimensional Carlson- 
Simpson subtree of W generated by v. Using property (P2), it is easily verified 
that V is as desired. The proof is completed. □ 

6. Embedding subsets of into basic sets 

Our goal in this section is to complete the analysis outlined in §4.1. In particular, 
we shall prove the following embedding result. 

Proposition 6.1. Let fc G N with k ^ 2. Also let W be a Carlson-Simpson tree of 
[fc]"^^ and G QW with \G\ ^ 2. Then there exists a basic set B of dimension and 
width at most \G\, and such that G ^ B . 

Proof. We may assume that W is of the form for some integer ^ 1. 

Observe that if G is flat, then we can write G as s^F where s = AG and F is a 
fiat set whose infimum is the empty word. In this case, the set B = {s} U {s^F) is 
the desired basic set. Therefore, in what follows, we may additionally assume that 
G is not fiat. 

First we set i = {n G {0,...,d} : G n [k]" ^ 0} and £ = \L\. Notice that 

2 < £ < We write the set L in increasing order as {ni < ... < n^}. For every 
i G {0,...,^- 1} let 



(6.1) 



Gi = Gr\ [kp+' 
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and 
(6.2) 



where, as in p.4p . Gj|ni+i = {wjni+i : w e Gj}. Observe that 
(6.3) |G.| sc: <C^|G.| 



for every i G {0, 1}. Hence, 

(PI) \Ho\ s=: \G\ and 

(P2) max{\Hi\ : i e {1,...,^- 1}} < \G\. 

Let i £ {1, — 1} be arbitrary. For every w G [fc]"*+i denote by w* the unique 
element of such that w = {w\Tii)'^w* . We set 



(6.4) 

Notice that 
(6.5) 
and 
(6.6) 



(P2) 

m^m < \G\ 



Gi Q Hi Hq hi ...'~'H*. 



Next we claim that for every i G {!,...,£— 1} there exists a flat set Fi such that 

(a) the infimum AFi of Fi is the empty word, 

(b) H* C F, and 

(c) ^ \h:\ + i. 

Indeed, fixzG{l,...,^ — 1}. If AH* is the empty word, then we set Ft — H* . 
Otherwise, the set H*\l is a singleton, and so, we may select a G [k] with a ^ H*\\. 
Let a be the unique word of length n.i+i — Ui all of whose coordinates are equal to 
a. We set Fi = H* U {a}. Clearly Fi is as desired. 

We are ready for the final step of the argument. Let s = AHq- Assume, first, 
that {s} = Hq. In this case we define 



(6.7) 



B ^{s}U (s^Fi) U ... U {s^F['..rFi^i). 



It is easy to check that B is the desired basic set (in fact, in this case, the dimension 
of B is at most \G\ — 1). Otherwise, write Hq as s^Fq where Fq is a flat set 
whose infimum is the empty word. We define B to be the basic set generated 
by the sequence (s, Fq, Fi, Ff_i). It is also easily verified that B satisfies all 
requirements. The proof is completed. □ 
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7. Correlation on basic sets: one-dimensional case 

7.1. The main result. We start by introducing some numerical invariants. Specif- 
ically for every k,p €N with k,p^ 2 and every < e ^ 1 let 

(7.1) qp = qp{k) ^kP-k, 

(7.2) Ap = Ap(fc,e) = A(A:9^eV4) ^16e-2DCS(fc«^ 1, £^16)1 
and 

Moreover, if m is a positive integer, then we set 

(7.4) CorJ(fc,m,e) = CS(A:, m ■ A^™ (fc, e), 1, 2'=) . 
We are ready to state the main result of this section. 

Proposition 7.1. Let /c, m G N with k ^ 2 and ni ^ 1, and < e ^ 1. Also let W 
be a Carlson-Simpson tree of [k]'^^ of dimension at least Cor*(fc,m,£). Then for 
every family {A„, : w G W} of measurable events in a probability space (51, 1],/i) 
satisfying /x(Au,) ^ e for every w G W, every p G {2, fc™} and every t G 7Fi[fc,p] 
with \t\ ^ m there exists an one- dimensional basic set B QW of width p and type 
T such that 

(7.5) fl A„) ^77p(/c,e). 

We emphasize that the lower bound in (17. 5p is independent of the length of the 
type T. Also we remark that Proposition 17. II is the first (and crucial) step towards 
the proof of Theorem 11.21 The proof of Proposition 17.11 will be given in §7.2. In 
§7.3 we isolate some of its consequences. 

7.2. Proof of Proposition [TTTJ First we introduce some terminology. For every 
pair v,v' G [fc]^^ we say that v' is a successor of w if u is a proper initial segment 
of v' . Notice, in particular, that if v' is a successor of w, then \v\ < \v'\. Moreover, 
it is easy to see that for every Carlson-Simpson tree V of generated by the 
sequence v and every v,v' G V we have that v' is a successor of v if and only if 
there exist {u,uo) G Subseq2(v) and a G [k] such that v = u and v' = u'~^uo{a). We 
have the following lemma. 

Lemma 7.2. Let k e N with k ^ 2 and < -d < e ^ 1. Also let N e N with 
N ^ (e^ — i9^)~^. Finally let W be a Carlson-Simpson tree W and assume that 

(7.6) dim{W) ^ CS(fc, N, 1,2*^). 

Then for every family {A^ : w G W} of measurable events in a probability space 
(ri, S],/i) satisfying fj,(Au]) ^ £ for every w G there exists an N -dimensional 
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Carlson-Simpson subtree V of W such that for every pair v,v' £ V with v' a 
successor of v we have 

(7.7) niA, n A,,) ;? d\ 

We notice that Lemma [7.21 is similar to [TJ Lemma 7.3]. Also we point out that 
in Proposition 1 10 . 1] we shall obtain a significant extension of this result. However, 
the argument below can serve as an introduction to the proof of Proposition 110. ![ 
and as such, we decided to present it for the benefit of the reader. 

Proof of Lemma \ 7. S\ Let w be the generating sequence of W. For every a S [k] set 

(7.8) Ta = {iu,ua) e Subseqi(w) : ^(A„ n A„„ „„(„,) ^ -d^}. 

By Theorem 12.31 and the estimate in (|7.6|) . there exists v G SubscqjY(w) such that 
for every a £ [k] either Subseqi(v) C Ta or Subseqi(v) nJa = 0- Therefore, it is 
enough to show that Subseqj^ (v) n J-"q 7^ for every a e [k]. To this end fix a G [k] 
and write v as {v,vq, vpf^ 1 ) . For every i € { 1 , . . . , N} we set 

(7.9) ti = v'~'vo{ay ...'~^Vi-i{a). 

Since N ^ {e"^ — -d^)'^, by Lemma [2.61 there exist i, j £ [N] with i < j and such 
that fi{At, n At^) ^1}^. We set 

(7.10) u — v"^vo{a)"^ ..."^Vi^i{a) and uq = ...^Vj^i. 

Observe that ti — u and tj — u'"uo{a). Hence (w, uq) G Subseq]^(v) n J-a and the 
proof is completed. □ 

Notice that for every pair q, n of positive integers the sets ([fc]'')" and ([fc]")'' can 
be naturally identified. Therefore, we may view a word over [fc]*^ as a sequence of 
length q having values in [fc]" for some n G N. This is, essentially, the content of 
the following definition. 

Definition 7.3. Let k,q £ N with k,q ^ 2. For every positive integer n we define 
the map 

(7.11) : mT m-r 

as follows. Fix t ~ (ao, a„_i) G ([fc]"^)" and for every j G {0, n — 1} and every 
i G {0, 1} let ai_j be the i-th coordinate ofsLj. Next for every i G {0, ...,g — 1} 

letti — (fli^o, ai,„_i) and define 

(7.12) V(t) = (to,...,Vi). 

Also let 

00 

(7.13) l,:{m<^^{0}u\ji[krr 

n=l 

be defined by lq{0) — and 

(7.14) I,(t)=I,,|,|(i) 
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for every t £ ([A:]«)<^ with \t\ ^ 1. 

We will need the following elementary fact. 

Fact 7.4. Let k,q with k,q^ 2. Also let s be a variable word over [k]''. Then 
there exists a unique sequence (sq, Sq_i) such that Si is a variable word over k 
with \si \ — \s\ for every i G {0, g — 1} and satisfying 

(7.15) Ig(s(a)) = (so(ao),---,Sg-i(ag-i)) 

for every a = (ao, ...,aq_i) G [fc]*. Moreover, if s is a left variable word, then so is 
Si for every i G {0, g — 1}. 

We are about to introduce one more definition. To motivate the reader, let us 
first notice that every word over [fc]* of length n can be represented as a word over k 
of length qn, just by concatenating its coordinates. We will need, however, a rather 
different representation. Its main properties are described in Lemma 17.71 below. 
Specifically, the representation is designed so that the image of every Carlson- 
Simpson line of ([fc]'')^^ contains a basic set of [k]'^^ of a given type r. 

Definition 7.5. Let fc,p G N with k,p ^ 2. Define qp and Cp as in (|7.ip and (14. 3p 

respectively. Notice that \Cp\ = qp. Write the set Cp in lexicographical increasing 
order as 

(7.16) Cp = {Ao <icx ••• <icx A<jp-i}. 

Let T G T{Cp) and set t = |t|. Consider the unique sequence (iq, i^-i) in 
{0, qp — 1} such that 

(7.17) r = (A.„,...,A.,_J. 

We define two maps 

(7.18) : ([fc]«-)<'' ^ [fc]<^ and <i>l^ : ([fc]9p)<N ^ [fc]<N 

as follows. First set ^p^ri^) = ^p,t{^) = ^- Otherwise, let t G ([fc]'?p)<^ with 
\t\ ^ 1 and write Igp(i) as (<o, ^ijp-i)- We set 

(7.19) <^pAt) = ..^{\_,) and $o_,(i)=i,„. 

We isolate, for future use, some properties of the maps ^p.r and All of 

them are straightforward consequences of the relevant definition. 

Fact 7.6. Let k,peN with fc,p > 2 and t G T{Cp). 

(a) For every t G ([fc]'p)<^ the word <^^.^{t) is an initial segment of^p,r{t). 
Moreover, if \t\ ^ 2 and \t\ ^ 1, then $p_T-(i) is a successor of ^(t). 

(b) For every t G ([fc]9p)<N we have |$p,r(t)| = \t\ ■ \t\. 

(c) For every t,s G ([/c]«f)<'^ with \t\,\s\ ^ 1 if ^p,r{t) = (t,^y ...^ {ti,_^) and 
*p,r(s) = (siJ'"...'^(sjj_J, then 
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We proceed with the following lemma. 

Lemma 7.7. Let k,p G N with k,p ^ 2 and r G TF\[k,p]. Also let qp be as in (|7.ip . 

(a) Let s be a variable word over [k]'^p and set n — \s\. Then the set 

(7.21) {<fp,,(s(a)) :ae 

contains a flat set F C of cardinality p and type r. 

(b) Let t be a word over [k]'^p and s be a left variable word over [/c]'p and set 
n — \t\ + \s\. Then the set 

(7.22) iKAt)} U {$p,r(rs(a)) : a e [k]""} 

contains a basic set B C [fc]^" !'^''^^ of dimension 1, width p and type r. 
Moreover, the top of B is the word $p ^ [t) . 

Proof. First we need to do some preparatory work. As in Definition 17. 5[ write 
the set Cp in lexicographical increasing order as {Ao <icx •■• <icx ^qp-i\- Also 
let £ = |r| and (iq, if-i) be the unique sequence in {0, ...,gp — 1} such that 
T = (Aiji , Ai^^j ). For every i G {0,...,qp — 1} and every m G {0,...,p— 1} let 
li,m G [k] be the m-th coordinate of A^. We set 

(7.23) b™ = (/o,™,...,^,,-i,m) e [kY- 
for every m G {0, — 1}. 

(a) Let 

(7.24) F = {$p,r(s(b,„)) : TO < p - 1}. 

We claim that F is as desired. Indeed, by Fact 17.61 we have that F C [fc]" ''^'. Hence 
F is flat. It is easily seen that the cardinality of F is p. Therefore, it is enough to 
show that F is of type r. To this end let (so, ■■■i Sqp-i) be the sequence of variable 
words over k obtained by Fact 17.41 for the variable word s. By (|7.15p and (|7.19p . 
we see that 

(7.25) $p,T(s(bm)) = Si^{li^^rn)^ ..rSi^_^{li^_^^ra) 

for every m G {0,...,p— 1}. Noticing that A^^ — {lij.m '■ m G {0, 1}) for every 

j G {0, 1}, by (|7.25p . we conclude that F has type (Ai„, Ai^_j) = r. 

(b) We will give the proof under the additional assumption that |<| ^ 1. If t is the 
empty word, then the proof is similar (in fact, it is simpler). First let 

(7.26) I,^(i) = (io,...,i,,-i) 

and (sq, Sgp_i) be the sequence of variable words over k obtained by Fact 17.41 
for the left variable word s. Notice, in particular, that Si is a left variable word for 
every i G {0, qp — 1}. Also observe that <^p .^{t) = ti^. Moreover, by Fact 17.61 and 
(fr26l) . we have 

(7.27) $p,^(i'"s(b„)) = (iio)'"Sio('io,™)'"---'"(**£-l)'"Sii-l(^*£-l,m) 
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for every m G {0, £ — 1}. We set 

(7.28) Fo = {s,,{U„^^Y(i^X -^^K^X : ^ m ^ p - l}. 

Arguing as in the first part of the proof, we see that Fq is a flat set which is contained 
in [fc]" !"^!"!*! and is of cardinality p and type r. Since (sqj Sgp-i) consists of left 
variable words, it is easy to verify that AFq is the empty word. Therefore, setting 

(7.29) B = {<,JU((i,J-Fo), 

we see that B is as desired. The proof is thus completed. □ 

The following lemma is the final step towards the proof of Proposition 17.11 

Lemma 7.8. Let k,p,£ with k,p^ 2 and £ ^ 1. Also let < e ^ 1 and define 
Ap — Ap{k,e) as in (j7.2l) . Finally let W be a Carlson-Simpson tree of [fc]^'^ and 
assume that 

(7.30) dim(Ty) ^ CS(fc,£- Ap,l,2'=). 

Then for every family {A^ : w G W} of measurable events in a probability space 
(f2, S,/i) satisfying p{Aw) ^ s for every w CzW and every t G TF\[k,p] with \t\ — £ 
there exists an one- dimensional basic set B C W of width p and type t such that 

(7.31) Aj)-^i^p{k,e) 
where rip{k,e) is as in (17. 3p . 

Proof. Notice first that, by Lemma applied for "z? = e/2" and "d = £ • Ap", 
there exists a Carlson-Simpson subtree y of 14^ with dim(y) = £ ■ Kp such that 

(7.32) p{A,nA,,)^e^/A 

for every v^v' G V with v' successor of v. 

Next observe that there is a natural bijection between the alphabets [k]'''' and 
[fc^p]. This bijection extends, of course, to a bijection between ([fc]'p)^^ and [fc'^pj^^. 
Keeping in mind these remarks, in what follows we will identify words (respectively, 
left variable words) over [fcj'^p with words (respectively, left variable words) over 
[fc'^p]. We will also identify V with via the canonical isomorphism ly. 

Fix T G 7Fi[fc,p] with |r| = £. Let qp be as in ([7T|) . For every t G ([A;]«p)<'^p set 

(7.33) it =^$o ^(t) n^$^^^(t) 

where ^p^r and <I>p ^ are as in Definition 17.51 By Fact 17. 6[ we have 

(7.34) $°^^(([/c]«'')<'^'') C [fc]<^-Af+i and $p,r(([fc]''')<^'') C [/c]<'^-'^''+i. 

Hence At is well-defined. Invoking Fact 17.61 once again and the estimate in (j7.32p . 
we see that /i(it) ^ for every t G ([/cj'^f )<'^p. By the choice of Ap in (|7.2p and 
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Lemma [2.81 applied for "fc — k'^p" and "(5 — there exist a word t over [k]'^p 

and a left variable word to over [fcj^p such that, setting 

(7.35) S ^{t}Ll{rto{a) ■.aeik]""}, 
we have 

(7.36) fl it) 77(fc«^£V4) ^ ^7p(fc,e). 

tes 

Notice that 

(7.37) 0^* = ^<E>n mnA,!, n n ^T.n / / , n 

t£5 ae[fc]«p 

c n n A / , , ,V 

ae[fc]9p ^ ' 

By Lemma 17.71 there exists an one-dimensional basic set B of width p and type r 
such that 

(7.38) B C {$;!,,(t)} U {$p,,(rto(a)) : a G [fc]^-}. 
By (fr36| . ([7:37)) and ([7381 . we conclude that 

(7.39) fl A^) ^ryp(fc,e) 

weB 

and the proof is completed. □ 

We are ready to complete to the proof of Proposition 17.11 

Proof of Proposition \ 7. 1] Let {^4^, : w G W} be a family of measurable events 
in a probability space (fi, E,^) satisfying ^(A^) ^ e for every w £ W. Also 
let p G {2,...,fc'"} and t G 7Fi[fc,p] with \t\ = £ ^ m. By ([7^ . we see that 
Ap{k,e) ^ Afcm(fc,e) and so 

(ED 

(7.40) dim(PF) > Cort(fc,m,£) ^ CS(fc, £ • Ap, 1, 2*=) . 



Thus, invoking Lemma [7. 8[ the result follows. □ 

7.3. Consequences. We will need the following consequence of Proposition [73] 

Corollary 7.9. Let fc G N with fc ^ 2 and < g ^ 1. Also let m,£ G N with 
1 ^ £ < m and V a Carlson-Simpson tree of [fc]^'^ with 

(7.41) dim(F) ^ Cor^J (fc, to, g) + to. 

Finally let p = (po, a'f^ t = (tq, T£_i) where pi E N with pi ^ 2 and 

Ti G Tpiik, Pi] for every i G {0, ...,£ — 1}, and assume that Yll=a ^ ™- -Assume, 
moreover, that {Ay : v G V^} is a family of measurable events in a probability space 
(ri, S,/i) such that for every B G ^e.p.tiV) we have 



(7.42) fl A,) ;^ g. 



v&B 
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Then for every p £ {2,...,fc™} and every t G 7Fi[fc,p] with \t\ ^ m there exists 
an {£ + 1) -dimensional basic set B' QV with p(-B') = {p,Poj ■■■,Pe-i) and t[B') — 
(r, To, ...,T£_i), and such that 

(7.43) fl A,) ^Vpik,g) 

veB' 



where r]p{k, g) is as in (|7.3p . 

Proof. Set d = dim{V) and identify V with [fc]<''+^ via the canonical isomorphism 
ly. Also fix a basic set Bi of [fc]<™+^ with top the empty word, dim(i?i) — I, 
p(Bi) = p and t{Bi) = t. We set 

(7.44) do = Cor*(A:,m, 
and for every s G [k]^'^°^^ let 

(7.45) B, = s^Bi and A, ^ Q A^. 

Let s £ [fc]'^''''+^ be arbitrary. Since d ^ do + m we see that Bs is a basic set of 
[fc]^''"'"^ with top the word s, dim(i3s) = £, p(i?s) = P and t{Bs) = t. In particular, 
we have that Bg £ ^e,p.ti[k]'^'''~^^)- Hence, by om^ assumptions, we conclude that 

(7.46) fi{A,) ^ g. 

Now let p e {2,...,fc'"} and t G T/[A:,p] with |t| ^ m. By (fTIi]) and (fT^ie]) . we 
may apply Proposition 17.11 for the family {As : s G [/c]^'^"'^^}, the integer p and 
the type r. Therefore, there exists an one-dimensional basic set Bq C [fc]<'^''+^ of 
width p and type r, and such that 

(7.47) m( n ^^p(fc'^?)- 

seBo 

Let (soj^o) be the generating sequence of Bq and define 

(7.48) B' = {so}U{sqFo)U{s-^F-Bi). 

Since the top of Bi is the empty word, we see that B' is an {£ + l)-dimensional 
basic set of with p{B') — {p,po, ...,pg-i) and t{B') — (r, To, T£_i). Also 

notice that B' C BqBi. Therefore, 

(7.49) f]As^ f] A,. C f]A. 

and so 

(7.50) m( n n ^0 

veB' seBo 

The proof is completed. □ 
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8. Correlation on basic sets: higher-dimensional case 

Our goal in this section is to obtain a higher-dimensional extension of Proposition 
17.11 This extension is the final step of the proof of Theorem 11.21 To state it we 
need to introduce some numerical invariants. 

Let fc G N with fc ^ 2 and < £ ^ 1. Also let p — {po, ...,pi-i) be a nonempty 
finite sequence in N with pi ^ 2 for every i G {0, — 1}. We define a sequence 
{Vi)iZo of positive reals recursively by the rule 

where rjp^_-^{k,e) and rip^_._^(k,rii) are as in (|7.3p . We set 

(8.2) r,p(fc,e) =r,,_i. 
Also let m e N with to ^ 1 and define 

(8.3) m(fc) = ( fc",. . .,fc" ) 

m— times 

and 

(8.4) Cor(fc, TO, e) = Rame ^/c, Cor^ (fc, to, ?7rn(fc) (^i £)) + ™i 2^ 

where Cor^ (fc, m, r7m(;j) (fc, e)) is as in (j7.4p . We have the following proposition. 

Proposition 8.1. Let k,m ^ N with k ^ 2 and to ^ 1, and < e ^ 1. Also let 
W be a Carlson-Simpson tree of [fc]^'*' of dimension at least Cor(fc, m, e). Then for 
every family {A^ '■ w G W} of measurable events in a probability space (17, 
satisfying /i(A^) ^ e for every w ^ W there exists a Carlson-Simpson subtree U of 
W with dim(J7) — m such that for every basic set B of U we have 

(8.5) n A^) ^ Vp{B){k,e). 

weB 

Proof. Wc fix a Carlson-Simpson tree W of [k]"^^ of dimension at least Cor(fc, to, e) 
and a family {Ay^ : w G W} of measurable events in a probability space (il, 
satisfying ^(A„) e for every w ^W. We set 

(8.6) g = {Be B{W) : fj A„) 77p(s)(fc, e)}. 

By Theorem 15.21 and ()8.4p , there exists a Carlson-Simpson subtree of with 

(8.7) dim(T^) = CorJ (fc, m, //^(fc) (fc, £)) + m 

and satisfying the following property. For every pair Bi , B2 of basic sets of V with 
the same type we have that Bi ^ Q \i and only if B2 G Q. 

Claim 8.2. Let £ G [m]. Also let p = (j>o, ...,pi-i) and t — (tq, t^_i) where 
Pi G N with Pi ^ 2 and Ti G TFi[k,Pi\ for every i G {0, — 1}. Assume that 
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Notice that, by Claim [821 any m-dimensional Carlson-Simpson subtree U of V 
satisfies the requirements of the proposition. Therefore, the proof will be completed 
once we prove Claim 

To this end we will proceed by induction on £. The case = 1" follows imme- 
diately form Proposition 17.11 since 

(8.8) dim{V) ^ CoTl{k,m,r]^^k){k, e)) ^ Cort(fc,m,£). 

So let ^ e [m — 1] and assume that the claim has been proved up to £. We fix 
two sequences p = {po,...,pi) and t — {to,...,T£) where € N with pi ^ 2 and 
Ti S TFi[k,Pi] for every i e {0, ...,£}, and such that X]i=o I'''*! ^ 

(8.9) p' ^ {pi,...,pi) and t' = (n, r^). 

By our inductive assumptions, we see that B^ p/ t'(V^) C Q. This is equivalent to 
say that 

(8.10) /i( fl A^) ^Vp'{k,e) 

for every B' e Bi^p>^t'{V). Observe that ri^(k){k,s) ^ r]p>{k,e). Hence, by (|8.7p . 

(8.11) dim(F) ^ Cori (fc, m, rjp, (fc, e)) + to. 

By (|8.10l) and (I8.1ip . we may apply Corollary [TH] for ''g — ?7p/(fc,e)", "p = po" and 
"r = tq". In particular, there exists a basic set BofV with p(i3) = (potPi, ■■■■,Pi) 
and t(_B) = (to, ti, r^), and such that 

(8.12) Pi aJ^ ^ r]p„{k,rjp'{k,e)) ''^ J?p(fe, e) = 77p(B) (fc, e). 

It follows that B e B£+i,p,t{V) fl ^7 and so B£+i^p^t(V) C ^ by the choice of V. 
This completes the proof of Claim [S?^ and, as we have already indicated, the entire 
proof is completed. □ 



9. Proof of Theorem 11.21 
Let fc e N with fc ^ 2 and < e ^ 1. We set 

(9.1) 6'(fc,e, l)=e. 
If n e N with n ^ 2, then let 

(9.2) n = (rwri) 

n — times 

and define 

(9.3) e{k,e,n)^rjn{k,e) 

where rin{k,e) is as in (18. 2p . Finally, for every positive integer to let Cor(fc,TO,e) 
be as in (18.41). We claim that with these choices the result follows. 
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Indeed, fix a Carlson-Simpson tree T of [k]"^^* of dimension at least Cor(fc, m, e) 
and a family {At : t G T} of measurable events in a probability space (fi, 
satisfying ^{At) ^ e for every t ^T. By Proposition 18.11 there exists a Carlson- 
Simpson subtree S* of T of dimension m such that 

(9.4) m( fl At) ^7yp(B)(fc,e) 

tea 

for every basic set B of 5. We will show that S is as desired. To this end let F 
be a nonempty subset of S and set n = If n = 1, then the estimate in ()1.3|) 
is automatically satisfied by the choice of 9{k,e, 1) in (|9.1I) and our assumptions. 
Otherwise, by Proposition 16.11 there exists a basic set B of width and dimension 
at most n and such that F <Z B C S. In particular, if p{B) = (po, ■■■,Pi-i), then 
^ S [ri] and pi G {2, ...,n} for every i G {0, ...,£ — 1}. This is easily seen to imply 
that ?7p(B)(fc,£) ^ Tjn{k,£)- Therefore, 

(9.5) m( n ^ n *9 ^P(s)(fc,£) ^ '7n(fc,£) ^ 9{k,e, \F\). 

t£F teB 

The proof of Theorem 11.21 is thus completed. 

10. Estimating the constant 0{k,e,2) 
This section is devoted to the proof of the following result. 

Proposition 10.1. Let fc, m G N with fc ^ 2 and m ^ 1, and < -d < e ^ 1. 

Then there exists a positive integer Cor2(fc, m, i?, e) with the following property. If 
W is a Carlson-Simpson tree of [k]'^^ of dimension at least Cor2(fc, m, e) and 
{A^ : w G W} is a family of measurable events in a probability space (51, S],/i) 
satisfying /^(Au,) ^ e for every w G W , then there exists a Carls on- Simp son subtree 
UofW with dim(J7) — m and such that for every u,u' we have 

(10.1) ^l{AuC^Au')^'^^. 

Proposition 110.11 implies, of course, that the constant 6{k,e,2) can be chosen to 
be — o(l). Its proof is given in §10.2 and is based on a detailed analysis of the 
Ramsey properties of pairs of Carlson-Simpson trees. This analysis is carried out 
in §10.1. 

We mention that, in what follows, for any set X by Pairs(X) we shall denote the 
set {F X : \F\ — 2}. While unconventional, this notation is quite informative 
and very convenient. 

10.1. Word representation of pairs of [fc]^'*'. Let k <E N with fc ^ 2 and, as in 
(1131), define 

(10.2) C2^[kf\{{a,a) -.aeik]}. 
Next fix a letter * and let 

(10.3) £ £2 U {(a,*) : a G [fc]} U {(*,a) : a G [k]}. 
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Setting A = [fc] U {*}, we see that L C A? . Also notice that |£| — P + k. As in 
§3.1, by W(fc,£) we denote the set of all words over [k] U £. 
We shall define an injective map 

(10.4) R : Pairs([A;]<'*^) W(fc, £) 

as follows. Let F G Pairs([A;]^'^) be arbitrary. We consider the following cases. 

Case 1: F is flat. In this case let R(F) = R2(F) where R2 : Fl2([A:]<f*) ^ W(A;, C) 
is the "representation" map defined in §4.3. 

Case 2: F = {w,u} with u a successor of w. Write u as w'"(ao, ctn-i) where 
Ui G [k] for every i G {0, n — 1}. We define 

(10.5) R{F)=w^{ao,...,an-i) 
where 



(10.6) 



for every i e {0, n — 1}. 



(*, ao) if tti = ao, 
Oj otherwise 



Case 3: F = {w,u} with w <iex u and \w\ < \u\. Let m = \w\ and n = \u\ and 
notice that m < n. Write w as (oq, ...,am-i) and u as (60, •••,&n-i), and define 

(10.7) R{F) = ((ao, &o), {am-i,bjn-i)y(bm, bn-i) 

where 



(10.8) bj 



{*,bm) if bj = bm, 
bj otherwise 



for every j G {m, n — 1}. 

Case A: F = {w^u} with w <iox u and \w\ > \u\. As in the previous case, let 
m = \w\ and n = Also write w as (ao, am_i) and u as (60, 6„_i). We 
define 

(10.9) R(F) = ((ao, 60), (a„_i, 6„_i))'"(a„, am-i) 

where 



(10.10) aj = 



for every j G {n, m — 1}. 



(a„,*) if aj = a„, 
i otherwise 



The above cases arc exhaustive, and so, this completes the definition of the map 
R which is easily seen to be injective. For every F G Pairs([A:]^^) we call the word 
R(F) as the "representation" of F. We define the type of F to be the type of its 
representation R(-F) in W(A;, £) (see §4.3). 



MEASURABLE EVENTS INDEXED BY WORDS 



33 



We remark that not all types in T{C) are realized as the type of a pair of [fc]^'*'. 
Actually, the type r of a pair F has a very particular form depending, of course, 
on the nature of F . Specifically, we have the following possibilities. 

(1) In Case 1 the type r is of the form ((ao, 6o), (a^-i, where £ is a 
positive integer and oq, &o, a^-ij ^f-i £ [^1- 

(2) In Case 2 the type r is of the form ((*,a)) where a e [k]. In particular, in 
this case there are exactly k many types. All of them have length 1. 

(3) In Case 3 the type r is of the form ((ao, 6o), (a^-i, (*, &) where I 
is a positive integer and ao, &oi •••i ^ G [^1- 

(4) In Case 4 the type r is of the form ((ap, 6o), (a^-i, 6£-i)) (a, *) where £ 
is a positive integer and ao, &o, •■•7 ^f-i: a G [k]- 

We proceed to discuss two important properties guaranteed by the above defi- 
nitions. They are isolated in Lemmas 110.31 and 110.41 below. However, first we need 
to introduce some pieces of notation. Specifically, for every Carlson-Simpson tree 
W of [fc]^'^ and every r 6 T{C) we set 

(10.11) Pairs^(I^) = {F e Pairs(M^) : is of type r}. 
Moreover, if a G [fc] U £ and i is a positive integer, then let 

(10.12) a* = (a, ...,a). 

times 

By convention, aP is defined to be the empty word. 
The following fact is straightforward. 

Fact 10.2. LetV he a Carlson-Simpson tree o/[fc]^'*' and denote by v its generating 
sequence. Then the following hold. 

(i) The restriction 0/ R to Pairs(V^) is an injection into ■v{k,C). 

(ii) Let T G T{C) and assume that the set Pairs7-(V^) is nonempty. Then the 
restriction of K to PairS7-(T^) is onto \'{k, C^t). In particular, the map 
R : Pairs7-(V^) ^> v(fc, £, r) is a bijection. 

We are ready to state the first main result of this subsection. 

Lemma 10.3. Let k,d,r,n be positive integers with k ^ 2. Also let W be a 
Carlson-Simpson tree of [k]"^^ and assume that 

(10.13) dim(W^) ^ FK(fc,fc2 + fc,d,r). 

Then for every r-coloring of the set Pairs(VF) there exists a Carlson-Simpson sub- 
tree V of W with <:\mi{V) = d such that for every type t G T{C) the set Pairs^(V^) 
is either empty or monochromatic. 

Proof. Fix a coloring c : Pair(T/F) — [r] and define c : w(fc, C) [r] by the rule 

fc{F) if there exists F G Pairs(W^) with R{F) = w, 
r otherwise. 
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By Fact ll0.2l the coloring c is well-defined. Next recall that |£| — k'^ + k. Therefore, 
by (|10.13|) and Theorem [131 there exists a d-dimensional Carlson-Simpson subse- 
quence V of w such that for every t G T{C) the set v(fc, £, r) is monochromatic. 
Let V be the Carlson-Simpson subtree of W generated by v. Invoking Fact 110.21 
we see that V is as desired. □ 

The final result of this subsection is the following lemma. 

Lemma 10.4. Let k,m £ N with fc ^ 2 and m eN, and < ?? < e ^ 1. Also let 
V be a Carlson-Simpson tree of [fc]^'*' with 

(10.15) dim(F) ^ \{e^ - z?^)"^] ■ (m + 1). 

Finally let {Ay : v G V} be a family of measurable events in a probability space 
(ri, S],/i) satisfying ^ e for every v d V . If t d T{C) is a type such that 

PairST-(y) ^ and \t\ ^ m, then there exists {v,v'} G PairS7.(t/) such that 

(10.16) fi{Ay n Ay') . 

Proof. We set d = dim{V). Clearly we may assume that V — [k]^'^^-^. Also let 

(10.17) no^\{s^~-^^r^^. 

Fix a type r G T{C) with |t| ^ m and such that PairsT-([fc]<'^+^) ^ 0. We need to 
find {v,v'} G PairsT-([fc]^''+^) such that the estimate in (|10.16p is satisfied for the 
pair {v,v'}. The argument below is not uniform and depends on the form of the 
type r. In particular, we distinguish the following cases. 

Case 1: we have that r = ((oq, 6o), (a^^i, where \t\ — £ ^ [m] and 

ao,bo, ...,ai-i,bi^i G [k]. For every i G {0, ...,no} define 

(10.18) V, = ibl,ria^^"-Tib\riaT-T--ribUriaZT)- 

Notice that Vi G [k]""'^. Hence, by (|10.15l) . we have Vi G [fc]^''"''^ which implies 
that fi{Ay.) > e. By (|10.17p and Lemma \2M there exist i,j G {0, ...,no} with 
i < j such that /i(At,. n Ay-) ^ •d'^. We set F = {vt, vj}. It is enough to show that 
F G Pairs^([fc]<''+i). Indeed, by (|10.18l) . we see that R{F) is the word 

(10.19) ibl,r{iao,boy-Ti<°~'r■■ribur{ia,.^M-ly~Ti^^^^^^^^^ 

This easily implies that the type of F is r. Since F C [k]^'^^-^ we conclude that 
F G Pairs^([fc]<''+i). 

Case 2: we have that r = ((*,a)) where a G [k]. For every i G {0,...,no} let 
Vi = a* and notice that Vi G [ky C In particular, we have fJ,{Ay.) ^ e. 

Arguing precisely as in the previous case, we find i,j G {0, ...,no} with i < j and 
such that fi{Ay. HAy.) ^ f?^. It is then easy to see that F — {vi,Vj} is as desired. 
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Case 3: we have that r — ((ao, bo), (a^-i, (*, b) where |t| — 1 = ^ G [m— 1] 

and ap, 6o, a^-i, b G [k]. For every i e {0, no} define 

(10.20) V, = (6^)-(a;j°-)-(fel)-(ar-)-...-(6^_i)-(a^'-^)-(6^) 

and notice tliat Vi £ [fc]no-^+* c [fej^'^^^. Arguing as above, we find i,j £ {0, no} 
with i < j and such that fJ,{Ay. n A^,^) ^ i?^. We set F — {vi,Vj}. Clearly it is 
enough to show that F £ PairsT-([fc]^'^^^). To this end we observe that, by (|10.20p . 
the "representation" K{F) of F is the word 

(10.21) ib},r{iao,b„y-Tia-°-=r... 

..ribur{iai-i,b,-^y-Tic^7-?ribT{(.'^,by-')- 

It follows that the type of F is r and, in particular, that F £ PairsT-([fc]<''+^). 

Case 4: we have that r — {{ao, bo), (a^-i, be-i)) (a, *) where \t\ — 1 — £ £ [m—1] 
and ao, bo, a£_i, bf_i,a £ [k]. For every i £ {0, no} let 

(10.22) V, = {bl)-{a-o''~T{b\)^{aT'T-r{bUr{aZTr{a') 

and notice that Vi £ [fc]"'''^+* C [fcj^'^+i. Hence, there exist ^ i < ^ no with 
/i(yl^. D Ay^) ^ and such that, setting F = {vi,Vj}, the "representation" K{F) 
of F is the word 

(10.23) (6j,)-((ao,&o)^-T(C"'r- 

...-(fe^_i)'^((a.-i,&.-i)^'-T(«"VrK)'^(K*F"0- 

Therefore, the pair F is as desired. The above cases are exhaustive and so the 
entire proof is completed. □ 

10.2. Proof of Proposition \TOA\ We set 

(10.24) Cor2(fc, TO, e) = FK{k, + k, \{e^ - i?^)-!] . (m + 1), 2) 

and we claim that with this choice the result follows. 

Indeed, fix a Carlson-Simpson tree W of [fc]^'^ with dim(VF) ^ Cor2(fc, to, e) 
and a family {A^ : w £ W} of measurable events in a probability space (51, 
satisfying ^(A„) e for every w £ W. We set 

(10.25) r=^F£ Pairs(T/F) : /^( p| A„) ^ 

tueF 

By ()10.24p and Lemma [10.3) there exists a Carlson-Simpson subtree F of with 
dim(l/) — [(e^ — ■d'^)^^^ • (to + 1) and such that for every type r £ T{C) with 
PairSr(V^) ^ we have that either Pairs^(F) C P or Pairs^(F) DV ^ 0. By the 
estimate on the dimension of V and Lemma [10.41 we see that PairsT-(V^) DP ^ 
provided that PairSr(T^) is nonempty and |r| ^ to. Therefore, for every type r with 
t| ^ to we have Pairs,- (t^) C V. Let U be any TO-dimensional Carlson-Simpson 
subtree of V. Clearly U is as desired. 
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11. Free sets 

In this section we discuss a second quantitative refinement of Theorem 11.21 
Specifically our goal is to obtain optimal lower bounds for the correlation of the 
events {At : t E F} in (|1.3|) provided that the set F is free. The class of free sets 
was introduced in j5] and includes various well-known subsets of Carlson-Simpson 
trees such as all finite chains, all doubletons and many more. Its definition and 
main properties are recalled in §11.1. In §11.2 we state the main result of this 
section, Theorem 1 1 1 .41 below. The proof of Theorem II 1.41 is given in §11.3. 

11.1. Free sets: definitions and main properties. We start with the following 
definition (see jS] Definition 6.1]). 

Definition 11.1. Let fc G N with fc ^ 2. Recursively, for every integer n ^ 1 we 
define a family FTn{[k]'^^^) as follows. First let FTi{[k]'^^) and Fr2([fc]^^) consist of 
all singletons and all doubletons of [k]"^^ respectively. Let n G N with n ^ 2 and 
assume that the family Fr„([fc]^'*') has been defined. Then Fr„+i([fc]^^) consists of 
all subsets of [fc]^^ which can be written in the form {w} U G where w G [fc]^'^ and 
G G Fr„([fc]<'^) are such that |w| < |A G|. We set 

(11.1) Fr([fc]<^) = U Fr„([fc]<«). 

An element o/Fr([fc]^^) will be called a free subset of [fc]^^. 

We have the following characterization of free sets. Its proof if straightforward. 

Fact 11.2. Let fc G N with fc ^ 2. Also let F be a finite subset of [k]<^'^ with 
\F\ ^ 3. Then F is free if and only if there exists an enumeration {wi, ...,Wn} of 
F such that 

(1) |wi| < \W2\ < ... < < \wn\ and 

(2) \wt\ < I A {wi+i, Wn}\ for every i G {1, ...,n - 2}. 

For every Carlson-Simpson tree W of [fc]<^ by Fr(W^) we shall denote the set 
of all free subsets of [fc]^'*' which are contained in W. Moreover, for every integer 
n ^ 1 we set 

(11.2) FTniW) = {F eFT{W) : \F\=n}. 

We isolate, for future use, the following fact. Its proof is also straightforward. 

Fact 11.3. Let fc,d G N with k ^ 2 and d ^ 1. Also let W be a d-dimensional 
Carlson-Simpson tree of [k]"^^ . Then the following hold. 

(a) For every F C [fc]<'^+-'^ we have that F is free if and only iflwiF) is free, 
where Iw is the canonical isomorphism associated to W . 

(b) For every integer n ^ 1 we have that Fr„(VF) ^ if and only if n — 1 ^ d. 

(c) For every Carlson-Simpson subtree V ofW we have thatFTn{V) C Fr„(VF). 
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11.2. The main result. We have the fohowmg extension of Proposition llO.il 

Theorem 11.4. Let fc, m, n G N with fc > 2 and m + 1 ^ n ^ 1, and < i) < e ^ 1. 

Then there exists a positive integer Cor„(fc, m, i?, e) with the following property. If 
W is a Carlson-Simpson tree of [k]'^^'^ of dimension at least Cor„(fc, to, i?, e), then 
for every family {A^ : w £ W} of measurable events in a probability space (17, 
satisfying /i(A^) ^ e for every w W there exists a Carlson-Simpson subtree U of 
W with dim{U) — m and such that for every F G Fr„(C/) we have 

(11.3) M(f|A^)^i9". 

By iterating Theorem lll.4| we can control simultaneously all free subsets of a 
Carlson-Simpson tree. Specifically, for every fc, m G N with fc ^ 2 and to 1, and 
every 0<i?<e^lwe define a sequence (A^i)™^^ in N recursively by the rule 



(11.4) 



Ni = m, 

iV,+i = Cor,+i(fc,7V,,z?,e) 



and we set 

(11.5) CorFr(fc, m, e) = N,n+i. 
We have the following corollary. 

Corollary 11.5. Let fc,TO G N with k ^ 2 and m ^ 1, and 0<i?<e^l. IfW 
is a Carlson-Simpson tree o/[fc]^'*' of dimension at least CorFr(fc, to, i?, e), then for 
every family {A^ : w G W} of measurable events in a probability space (fi, I],/i) 
satisfying ^,{Aw) ^ e for every w there exists a Carlson-Simpson subtree U of 
W with dim(t/) = m and such that for every F G Fr(f7) we have 

(11.6) n 

11.3. Proof of Theorem lll.4l For every probability space (fi, S, /i), every F G S 
with ^{Y) > and every A G S let 

be the conditional probability of A relative to Y. We have the following proposition. 

Proposition 11.6. Let fc, d G N with fc ^ 2 and d ^ 1, and < rj < g ^ 1. Then 
there exists a positive integer Ke\{k,d,r], g) with the following property. If W is 
a Carlson-Simpson tree W of [fc]^^ of dimension at least Ke\{k,d,ri, g), then fi 



or 



every family {Aw : w G W} of measurable events in a probability space (il, I],/i) 
satisfying /i(yl„,) ^ g for every w G W there exists a Carlson-Simpson subtree V 
of W with dmi{V) — d and .such that for every w,v V we have 

(11.8) ^iiAw I A,) > r). 
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Proof. First we introduce some numerical invariants. We set 

(11.9) x^{g.^-^y. 



Notice that A > 1. Also let 
(11.10) 

It is easy to see that 



In g-^ 
In A 



(11.11) < 77 < X'^g < g< \g< X'^g< ... < V'^g < 1 < A>. 
For every i e {0, — 1} let 

(11.12) di^Cor2{k,d,X'-^g,X'g) 
and define 

(11.13) A = max{d, : s$ i ^ r - 1}. 
We set 

(11.14) Rel(fc, d, r], g) = CS(fc, A + 1, 1, r) 

and we claim that with this choice the result follows. 

Indeed let be a Carlson-Simpson tree of [fc]<^ with dim(T4^) ^ Rel(fc, d, ry, g) 
and {Auj : w € W} be a family of measurable events in a probability space (51, S, fi) 
satisfying fi{Aw) ^ g for every w G W. By (|11.14p and Lemma \Tl5[ there exist a 
Carlson-Simpson subtree U oi W with dim(C/) = A and io S {0, ...,r — 1} such 
that for every u € U we have 

(11.15) A'V s; < A^o+V- 
Notice that 

(11.16) dim((7) ^ Coi2{k,d, X'°-^ g, y° g). 

Therefore, by Proposition llO.il there exists a Carlson-Simpson subtree V oiU with 
dim(y) — d and such that 

(11.17) fi{A^nA^,)^ X'^'"~'^g'^ 

for every v,v' G V. By ()11.15p and ()11.17p . and taking into account that A > 1 
and zq J5 0, we conclude that 

(11.18) I = ^^^(^^,) ^ - ^ ,-3^ ^. 
The proof is completed. □ 



We are ready to proceed to the proof of Theorem 111.41 
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Proof of Theorem \11.4\ The proof proceeds by induction on n. Of course the case 
"n = 1" is straightforward, while the case "n = 2" follows from Proposition llO.il 
So let n G N with n ^ 2 and assume that the numbers Cor„(fc, to, i?, e) have been 
defined for any choice of admissible parameters. We fix fc,TO £ N with fc ^ 2 and 
TO ^ n, and < d < e ^ 1. To define the number Cor„+i(fc, m, -d, e) we need to do 
some preparatory work. First let / : N — ?> N be defined by the rule 

(11.19) /(x) = Cor„(fc,^,^,^) 

if x ^ n — 1 and f{x) ~ otherwise. Next we define a sequence (dif^S^^^ in N 
recursively by the rule 



(11.20) 



rfm-)i+l — n — 1, 



where Z*-*^^' stands for the (fc^' ^)-th iteration of /. Finally let 

(11.21) Cor„+i(fc, TO, e) = Rel(^A:, dg, — ^ — , e 

We claim that with this choice the result follows. 

Indeed, fix a Carlson-Simpson tree W of with 

(11.22) dim(H/) ^ Relffc, do, ^A;^.^ 



2 

Also let {Am : w S W} be a family of measurable events in a probability space 
(f2, E,/i) satisfying /i(Au,) £ for every w G VF. By (|11.22[) and Proposition II 1.6[ 
there exists a Carlson-Simpson subtree ^ of with dim(l/) = do and such that 

(11.23) M^„|A„0^'^^^ 



2 

for every w, v' G V . 

Recursively we shall construct a sequence (V^)™ q"^^ of Carlson-Simpson trees 
of [A;]^^ with Vb = ^ and such that for every i G [to — n+1] the following conditions 
are satisfied. 

(CI) We have that Vi is a Carlson-Simpson subtree of Vi-\ of dimension i + di. 
(C2) For every j G {0, - 1} we have that Vi{fj) = Vi-i(j). 
(C3) For every u G Fi(i — 1) and every G G YxniVi) with AG G Vt(i) for some 
j G {i, dini(Vi) — 1} we have 

(11.24) /i( fl Am I A,) ^ 7?". 

We proceed to the construction. Let i G [to — n + 1] and assume that the Carlson- 
Simpson trees Vb, Vi-\ have been selected so that conditions (CI), (C2) and (C3) 
are satisfied. Set 

(11.25) d = dim(V;_i) ^ = ^i-l + dj_i 
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and denote by v = {v,vo, ...,Vd-i) the generating sequence of Vi_i. Let T be the 
Carlson-Simpson tree generated by the Carlson-Simpson sequence (0, Vi, Vd-i)- 
Notice that 

(11.26) dMT)=d-^^d.^,-l^ f('^''-'\dO. 

Also observe that \V^-i{i-l)\ = fc^^^ and |Vi-i(i)| ^ k\ By (lll.23p . the definition 
of the map / in (|11.19l) and using our inductive assumptions successively fc^'~^ 
many times, we find a Carlson-Simpson subtree S oi T with dim(S') = and such 
that for every v S Vi^i{i — 1), every t £ Vi_i(«) and every G G Fr„(i'"S') we have 

(11.27) /i( n 1 ^ 

weG 

We set 

i-l 

(11.28) V,= (^\Jv,^iU))^{ U ^'^^ 

j=o tei^.-i(i) 

It is easily checked that Vi satisfies condition (C1)-(C3). The recursive selection is 
thus completed. 

We are ready for the final step of the argument. We set U = Vm-n+i and we 
observe that 

(11.29) dim(C/) = m — n + 1 + dm-n+i = m. 

Let H e Fr„+i([/) be arbitrary. Write H as {w} U G where v gU and G £ Fr„(?7) 
are such that \v\ < | A G|. By Fact 111.31 and (|11.29p . we see that there exists 
io e {0, TO — n + 1} such that w G [/(iq — 1). Invoking conditions (CI) and (C2), 
we get that v £ Vig{io — 1), G £ Fr„(Vi„) and AG £ Vig{j) for some j > io. Hence, 
by (C3), we conclude that 

/ ^ \ / ^ \ 111.241 1 

(11.30) f] A^j = fii^ f] \ A,j ■ ^iiA,) > d^-e>d^+\ 

weH w£G 

This completes the proof of the general inductive step and so the entire proof is 
completed. □ 

12. Comments 

Let /c, n, TO £ N with fc ^ 2 and n ^ m ^ 1. A subset of [fc]" of the form 
(12.1) {?z;(ao, ...,a„i_i) : ao, a,„_i £ [k]} 

where w is an m-variable word over k of length n, is said to be an m- dimensional 
combinatorial subspace of [fc]". We point out that all the results in this paper remain 
valid for the category of combinatorial subspaces. Indeed, any of our statements 
about Carlson-Simpson trees implies, naturally, its counterpart for combinatorial 
subspaces. For instance, Theorem 11.21 has the following consequence. 



MEASURABLE EVENTS INDEXED BY WORDS 



41 



Corollary 12.1. For every /c, m e N with fc ^ 2 and m ^ 1, and every < £ ^ 1 
let Gor{k, e,m) be as in Theorem \1.2[ If N ^ Cor(fc, e, m) and {A^ : w G [fc]^} is 
a family of measurable events in a probability space (17, satisfying fj,{Aw) ^ e 

for every w G [k]^ , then there exists an m-dimensional combinatorial subspace V 
of \k]^ such that for every n ^ {1, ...,\V\} and every subset F of V of cardinality 
n we have 

(12.2) /x( fl A„) ^0(fc,e,n) 

weF 

where 9{k,e,n) is as in Theorem ll.Si 

Proof. As we have already indicated, we will reduce the proof to Theorem II .21 We 
will argue as in [71 Proposition 11.13]. Specifically, fix an integer N > Cor(fc,£,m) 
and a family {A^ : w G [k]^} of measurable events in a probability space (fi, E, n) 
satisfying fJ.{A^) ^ e for every w G [k]^ . For every i e {0, N} and every t G [ky 
let i = t-^l^^* e [fc]^, where 1^"* is as in (|10.12p . and define 

(12.3) At = Af. 

By Theorem II. 2[ there exists an m-dimensional Carlson-Simpson subtree S of 
[fc]^^+^ such that for every nonempty F 'Z S we have 

(12.4) ^,(^f]At)^e{k,e,\F\). 

teF 

We set V = {i : t e S{m)}. Clearly V is as desired. □ 

Appendix A. 

Our goal in this appendix is to give the proof of the following quantitative 
strengthening of Theorem 12.21 

Theorem A.l. There exists a primitive recursive function ip : N"^ ^ N belonging 
to the class £^ of Grzegorczyk's hierarchy such that 

(A.l) GR{k,d,m,r) ^ ip{k,d,m,r) 

for every integer k ^ 2, every pair of integers d ^ m ^ 1 and every integer r ^ I. 

Theorem lA.il is based on several intermediate results which are of independent 
interest. All of them are well-known - see, e.g., [Ill \17\ \19\ [20] - and follow from 
the Hales- Jewett Theorem. We recall this material in §A.l and §A.2. The proof of 
Theorem lA.il is given in §A.3. 
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A.l. Folkman's Theorem. Let T = (Fo,...,F„) be a nonempty finite sequence 
of pairwise disjoint nonempty finite subsets of N. Tlie finite unions of J- is defined 
to be the set 



The following result is known as Folkman's Theorem. 

Theorem A.2. For every d, r G N with d,r ^ 1 there exists a positive integer N 
with the following property. If n ^ N , then for every sequence J- — (i^o, Fn-i) of 
pairwise disjoint nonempty finite subsets ofN and every r -coloring o/FU(J^) there 
exists a sequence Q = (Go, Gd-i) in FU(J^) consisting of pairwise disjoint sets 
such that the set FU(CJ) is monochromatic. The least integer N with this property 
will be denoted by F((i, r). 

We will sketch a proof of Theorem I A . 2 1 emphasizing . in particular, the bounds we 
get from the argument. To this end we need to introduce some numerical invariants. 
Specifically, for every /c, m, r G N with k ^ 2 and m,r ^ 1 let MHJ(A:, m, r) be the 
"multidimensional Hales- Jewett number" for the parameters k,m,r. It is defined 
to be the least integer N ^ m such that for every n ^ N and every r-coloring of 
[fc]" there exists a monochromatic m-dimcnsional combinatorial subspace of [fc]". 
It is well-known - see, e.g., 031 Proposition 4.1] - that 



Next, for every pair d, r of positive integers we define a sequence (n^) in N recursively 
by the rule 



We have the following proposition. 

Proposition A. 3. For every pair d,r of positive integers we have F((i, r) ^ n^r- 
In particular, there exists a primitive recursive function ■f/^i : — S> N belonging to 
the class £^ of Grzegorczyk's hierarchy such that F(d, r) ijjiijl^r). 

We will need the following fact. It is a straightforward consequence of the defi- 
nition of the "multidimensional Hales- Jewett numbers" . 

Fact A.4. Let n,r be a pair of positive integers and set N — MHJ(2, n, r). Also let 
J- = {Fq, Fn-i) be a sequence of pairwise disjoint nonempty finite subsets ofN. 
Then for every r-coloring o/FU(7^) there exist G G {0} U FU(J^) and a sequence 
Q = (Go, G„_i) in FU(7^) consisting of pairwise disjoint subsets ofN\G such 
that the set {G U H : H G FU(^)} is monochromatic. 

We proceed to the proof of Proposition IA.3I 



(A.2) 




(A.3) 



MHJ(2,TO,r) 



(A.4) 
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Proof of Proposition lA.^ Let J-" — {Fq, Fn^^_i) be a sequence of pairwise dis- 
joint nonempty finite subsets of N. Fix a coloring c : F\J{F) [r] and set Gdr ~ 
and Qdr = T . Using Fact I A. 4] and backwards induction, for every i G {0, — 1} 
we may select sets G^, Gg \ G^'^Li such that, setting Qi = (Gq \ G^j^Li), the 
following conditions are satisfied. 

(CI) We have that Gj G {0} U FU(^i_|_i). Moreover, the sequence Qi is in 

FU(t/j+i) and consists of pairwise disjoint subsets of N \ G^. 
(C2) The set {Gi U H : H ^ F\J{Qi)} is monochromatic with respect to c. 

The construction is, of course, straightforward. Next we distinguish the following 
(mutually exclusive) cases. 

Case 1: for every i G {0, ...,dr — 1} the set d is nonempty. We set 
(A.5) ^'-(Go,...,Gd,_i). 

By condition (CI), the sequence J-' is in FU(J-') and consists of pairwise disjoint 
sets. On the other hand, by condition (C2), for every pair Si and 52 of nonempty 
subsets of {0, dr — 1} with max(5i) = max(S'2) we have that 

(A.6) c( U G.) =c( y G,). 

Therefore, by the classical pigeonhole principle, there exist io < ii < ... < id-i in 
{0, dr — 1} such that the set FU((Gio, ^id-i)) is monochromatic. 

Case 2: there exists io £ {0,...,dr — 1} with Gi„ = 0. Notice that Ui^ ^ d. By 
conditions (Cl) and (C2), the sequence G = {G^°\ ...^G^^^^) is as desired. 

This completes the proof that F{d,r) ^ Udr- Finally, plugging in this upper 
bound the estimate in ()A.3|) and using elementary properties of primitive recursive 
functions (see, e.g., |22|). we see that there exists a primitive recursive function 
■01 : N belonging to the class such that F(d, r) ^ 4'i{d,r). The proof is 

completed. □ 

We will need a slight variant of Folkman's Theorem which is, essentially, the 
finite version of Hindman's Theorem [16j. To state this variant it is convenient 
to introduce some terminology. Specifically, let n ^ 1 and T = {Fq, ...,F„) be a 
sequence of pairwise disjoint nonempty finite subsets of N. We say that is a block 
sequence if max(i^i) < min(_Fi+i) for every i G {0, ...,n — 1}. If G [n] and J-" is a 
block sequence of length n, then a sequence G = (Gq, ...,Gd) is said to be a block 
subsequence of if ^/ is a block sequence and Gi G FU(J^) for every i G {0, ...,d}. 
We have the following corollary. 

Corollary A.5. For every d, r G N with d,r ^ 1 there exists a positive integer N 
with the following property. If n ^ N , then for every block sequence F of length n 
and every r-coloring ofF\]{F) there exists a block subsequence G of F of length d 
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such that the set F\J{Q) is monochromatic. The least integer N with this property 
will be denoted by H(d, r). 

Moreover, there exists a primitive recursive function ■02 : — >■ N belonging to 
the class £^ of Grzegorczyk's hierarchy such that H((i, r) ^ ^p2{d,r). 

Proof. For every triple d, m, r of positive integers with d ^ m let R(d, m, r) be the 
Ramsey number for the parameters d, m, r. It is defined to be the least integer 
n ^ d such that for every coloring c : {F C [n] : \F\ = m} [r] there exists L C [n] 
with \L\ = d such that the set {F L : \F\ — m} is monochromatic. It follows from 
well-known estimates - see, e.g., [Hllllllin] ~ that the Ramsey numbers are bounded 
by a function belonging to the class Also notice that if n ^ R{d + m, m, r™), 
then for every coloring c : {F C [n] : \F\ ^ m} — > [r] there exists L C [n] with 
\L\ = d such that for every i e [m] the coloring c is constant on {F L : \F\ — i}. 
Using this observation, we see that 

(A.7) H(d, r) < r(2 • F{d, r),F{d, r), r^^''''-) ' 

and the proof is completed. □ 

A. 2. The finite version of the Milliken Taylor Theorem. The following re- 
sult is the finite version of the Milliken-Taylor Theorem [THl [53] . 

Theorem A. 6. Let d,m,r £ N with d ^ m ^ 1 and r ^ 1. Then there exists 
a positive integer N with the following property. If n ^ N , then for every block 
sequence T of length n and every r-coloring of the set of all block subsequences of 
T of length m there exists a block subsequence Q of T of length d such that the set 
of all block subsequences of Q of length m is monochromatic. The least integer with 
this property will be denoted by MT(d,m,r). 



Just as in §A.l, we will sketch a proof of Theorem IA.6I focusing on the bounds 
we get from the argument for the numbers MT{d, to, r). To this end, first, we define 
a function /i : — > N recursively by the rule 



(A 



J h{0,e,m,r) ^0, 

j h{i + l,e,m,r)= }l(ji{i, i, TO, r) -|- 1, ^2(™+«)™y 
Next, we define a function / : N'^ — > N also recursively by 



(A.9) 



f{d,l,r) = R{d,r), 

f{d, m + l,r) = m + h(^f{d, to, r) — m, f{d, m, r) — to, to, r) . 



Above we use the convention that h{i,£,m,r) = 0, respectively f{d,m,r) — 0, if 
some of the arguments is 0. We have the following proposition. 

Proposition A.7. For every d,m,r €z N with d ^ m ^ 1 and r ^ I we have 

(A.IO) MT(d, TO, r) f{d, m, r). 
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In particular, the numbers MT((i, m, r) are bounded by a primitive recursive function 
belonging to the class £^ of Grzegorczyk's hierarchy. 

Proof. First we will show that the estimate in (|A.10| is satisfied. We will argue 
by induction on m. The case "to — 1" is straightforward. Let d,m,r G N with 
(i ^ TO + 1 ^ 1 and r ^ 1 and assume that MT{d,m,r) ^ f{d,m,r). We set 
£ = MT(d, TO,r) — TO and Ni = h{i,i,m,r) for every i e {0,. ..,£}. Clearly, it is 
enough to prove that MT{d, m + l,r) ^ m + N^. 

To this end, fix a block sequence J- of length m + Ng and a coloring c of the set 
of all block subsequences of J- of length m + 1. By backwards induction, we may 
select a family (J-i)f^i of block sequences satisfying the following properties. 
(CI) The length of J^^ is m + £ — i + Ni for every i e [£]. Moreover, ~ T . 
(C2) The sequence Ti is a block subsequence of Ti^\ for every i G [£ — 1]. 

Moreover, Ti and agree on the first to + £ — i — 1 elements. 
(C3) Let i G [£ — 1] and (Go, G,„_i) be a block subsequence of length m of the 

first TO + — i — 1 members of Ti . Also let F and F' be nonempty unions of 

the last Ni + \ members of Ti. Then, the block sequences (Go, G,n~i,F) 

and (Go, Gm-i,F') have the same color. 
Such a selection can be easily carried out using Corollary IA.5I and invoking the 
definition of the function h in (jA.8|) . By condition (CI) and the choice oi £, it 
is easily verified that the length of the block sequence Fi is MT{d,m,r). Next 
observe that, by condition (C3), the color of a block subsequence Q of Fi of length 
TO + 1 depends only on the first m elements of Q. By our inductive assumptions 
and taking into account the above remarks, there exists a block subsequence Ti of 
Fi of length d such that the set of all block subsequences of H of length m + 1 is 
monochromatic with respect to c. 

Finally notice that, by Corollary IA.51 the function h is bounded by a primitive 
recursive function belonging to the class £^ and, therefore, the function / is bounded 
by a function belonging to the class The proof is completed. □ 

A. 3. Proof of Theorem lA.li Fix the parameters k,d,m,r and set 
(A.ll) £ = MT(d,TO,r). 

Define recursively a sequence (-/Vi)i=o ^ ^^^^ 
/ A^o = 0, 

Notice that, by Proposition I A. 71 and the bounds for the "Hales- Jewett numbers" 
obtained in [53], it is enough to show that GR(fc, d, to, r) ^ Ng. To this end, let c be 
an r-coloring of the set of all m- variable words over k of length Ng. By backwards 
induction and using the Hales- Jewett Theorem, we may select a sequence (i^Oi^o 
of variable words over k such that for every i G {0, ...,£ — 1} the following conditions 
are satisfied. 
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(CI) The variable word Wi has length iVi+i — Ni. 

(C2) Let V be an m-variable word over k of length Ni + £ — i — 1 and for every 
j e {0, Ni + £ — i} let Vj be the j-th coordinate of v. Then the set 

{{vq, .■.,VN^_iywi{a)'^Wi+i{vN.)'^w^+2{vN,+iy--'^we_i{vNi+e-i-2) ■ a £ [k]} 

is monochromatic with respect to c. 

The selection is fairly standard and the details are left to the reader. 

We will isolate, for future use, a property guaranteed by the above selection. 
To state it we need to introduce some pieces of notation. Specifically, for every 
m-variable word u — {uq, over k of length £ and every i G {0, m — 1} 

let Si{u) — {j e {0, ...,£ — 1} : uj = Xi] and define 

(A.13) S{u)^{So{u),...,S.m-i(u)). 

Notice that S{u) is a block sequence in {0, ...,£ — 1} of length m. If u happens to 
be a variable word (that is, if m = 1), then the set S{u) is referred to the literature 
as the wildcard set of u. 

Fact A. 8. For every pair u = (uq, itf^i) and v ~ (wq, w^-i) of m-variable 
words over k of length £ with S{u) = S{v) we have that 

(A.14) c(wa{uo)'^...^wi-i{ui^i)) ^ c(woivoy —^Wi-iivi^i)) . 

Fact IA.8I follows easily by condition (C2). It implies, in particular, that the 
coloring c induces an r-coloring c of the set of all block sequences in {0,...,£ — 1} 
of length m. By the choice of £ in (jA.lip . there exists a block sequence S in 
{0, 1} of length d such that the set of all block subsequence of S of length m 

is monochromatic with respect to c. Let u = (uq, be an arbitrary d-variable 

word over k of length £ such that S{u) = S and set w = wo{uq)'^ ...'"wi-i{ue-i). It 
is easy to check that w is a d-variable word over k of length iV^ such that the set 
of all m-variable subwords of w is monochromatic with respect to c. The proof of 
Theorem lA.il is thus completed. 

Appendix B. 

This appendix is devoted to the proof of the following proposition. 
Proposition B.l. For every k, d,m,r E N with fc^2, d^m^l and r ^ 1, 
(B.l) CS(fc, d, m, r) < GR(fc, d -I- 1, m + 1, r) - 1. 

By Theorem I A. 1 1 and Proposition lB.il we have the following corollary. 

Corollary B.2. The numbers CS{k,d,m,r) are bounded by a primitive recursive 
function belonging to the class £^ of Grzegorczyk 's hierarchy. 
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For the proof of Proposition IB.ll we need to do some preparatory work. Let 
w = (w, Wo, w„-i) be an n-dimensional Carlson-Simpson sequence over k. Also 
let m G [n\. Firstly, for every m- variable word z over k of length at most n we define 
a sequence (?^^)^Lo ^ follows. Let £ G [n] be the length of z and write the 
m- variable word z as {zq, zi^i). We set Um = and for every i G {0, m — 1} 
let rii = min{j G {0, — 1} : Zj = Xi}. Notice that 

(B.2) < no < ni < ... < nm-i < nm — £■ 

Next, fix a variable x and for every j G {0,...,^ — 1} set z'^ = Zj if Zj G [/c], and 
= a; otherwise. Clearly, the sequence z' = (zq, z^.^) is a variable word over k 
with the same length as z. Let 

(B.3) V = w'"ii;o(2;o)'"...'"w„o_i(2„o-i) G [fc]<'^ 

with the convention that w is ui if np = 0. Moreover, for every i G {0, m — 1} let 

(B.4) = WnAZnX-'~'^"^+i-'L(^n, + i-l) 

and notice that is a left variable word over k. We set 

(B.5) W(E) Z ^ {v,Vo,...,Vm-l)■ 

lt is clear from what we have said that w (g) z is an m-dimensional Carlson-Simpson 
subsequence of w. Finally, for every (m + l)-variable word u over k let u* to 
be unique m-variable word which is obtained by restricting u just before the first 
appearance of the last variable. 

We summarize below some properties guaranteed by the above constructions. 
All of them are straightforward consequences of the relevant definitions. 

Fact B.3. Let w be an n- dimensional Carlson-Simpson sequence over k. Also let 
m,d £ [n] with to ^ d and y be a {d+ 1) -variable word over k of length n + 1. Then 

(B.6) Subseq^(w) = |w (E> z : z is an m-variable word of length at most n} 

= ® u* : u is an (m + 1) -variable word of length n + l} 

and 

(B.7) Subseq,„(w ® y*) = |w (X) u* : u is an (to + l)-variable subword of y} . 

We are ready to proceed to the proof of Proposition lB.il 

Proof of Proposition \B.1[ Let w be an Carlson-Simpson sequence over k such that 
dim(w) > GR(fc,d + l,m + l,r) — 1 and fix an r-coloring c : Subseq^(w) — s> [r]. 
Denote by n the dimension of w. By Fact lB.3l we may define an r-coloring c of the 
set of all (to + l)-variable words over k of length n + 1 by the rule c{u) = c(w (g) tt*). 
By Theorem 1 2 . 2 1 and the estimate on dim(w), there exists a (d + l)-variable word 
y over k of length n + 1 such that the set of all (m + l)-variable subwords of y is 
monochromatic with respect to the coloring c. Setting v = wcg)?/*, by Fact IB. 3l and 
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the definition of c, we see that the set Subseq^(v) is monochromatic with respect 
to c. The proof is completed. □ 
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